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Preface to the Second Edition 


In 2013 the world mathematical community is celebrating the 100th anniver- 
sary of Paul Erd6és’ birth. His personality is remembered by many of his 
friends, former disciples, and over 500 coauthors, and his mathematics is as 
alive and well as if he was still among us. In 1995/1996 we were preparing 
the two volumes of The Mathematics of Paul Erdés not only as a tribute to 
the achievements of one of the great mathematicians of the twentieth century 
but also to display the full scope of his ceuvre, the scientific activity which 
transcends individual disciplines and covers a large part of mathematics as 
we know it today. We did not want to produce just a “festschrift” . 

In 1995/1996 this was a reasonable thing to do since most people were 
aware of the (non-decreasing) Erdés activity only in their own particular area 
of research. For example, we combinatorialists somehow have a tendency to 
forget that the main activity of Erdés was number theory. 

In the busy preparation of the volumes we did not realize that at the end, 
when published, our volumes could be regarded as a tribute, as one of many 
obituaries and personal recollections which flooded the scientific (and even 
mass) media. It had to be so; the old master left. 

Why then do we think that the second edition should be published? 
Well, we believe that the quality of individual contributions in these volumes 
is unique, interesting and already partly historical (and irreplaceable— 
particularly in Part I of the first volume). Thus it should be updated and 
made available especially in this anniversary year. This we feel as our duty 
not only to our colleagues and authors but also to students and younger 
scientists who did not have a chance to meet the wandering scholar personally. 
We decided to prepare a second edition, asked our authors for updates and 
in a few instances we solicited new contributions in exciting new areas. The 
result is then a thoroughly edited volume which differs from the first edition 
in many places. 

On this occasion we would like to thank all our authors for their time and 
work in preparing their articles and, in many cases, modifying and updating 
them. We are fortunate that we could add three new contributions: one by 
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Joel Spencer (in the way of personal introduction), one by Larry Guth in 
Part IV of the first volume devoted to geometry, and one by Alexander 
Razborov in Part I of the second volume devoted to extremal and Ramsey 
problems. We also wish to acknowledge the essential contributions of Steve 
Butler who assisted us during the preparation of this edition. In fact Steve’s 
contributions were so decisive that we decided to add him as co-editor to 
these volumes. We also thank Kaitlin Leach (Springer) for her efficiency and 
support. With her presence at the SIAM Discrete Math. conference in Halifax, 
the whole project became more realistic. 

However, we believe that these volumes deserve a little more contem- 
plative introduction in several respects. The nearly 20 years since the first 
edition was prepared gives us a chance to see the mathematics of Paul 
Erdos in perspective. It is easy to say that his mathematics is alive; that 
may sound cliché. But this is in fact an understatement for it seems that 
Erdos’ mathematics is flourishing. How much it changed since 1995 when 
the first edition was being prepared. How much it changed in the wealth 
of results, new directions and open problems. Many new important results 
have been obtained since then. To name just a few: the distinct distances 
problem, various bounds for Ramsey numbers, various extremal problems, the 
empty convex 6-gon problem, packing and covering problems, sum-product 
phenomena, geometric incidence problems, etc. Many of these are covered by 
articles of this volumes and many of these results relate directly or indirectly 
to problems, results and conjectures of Erdés. Perhaps it is not as active 
a business any more to solve a particular Erdés problem. After all, the 
remaining unsolved problems from his legacy tend to be the harder ones. 
However, many papers quote his work and in a broader sense can be traced 
to him. 

There may be more than meets the eye here. More and more we see that 
the Erdos problems are attacked and sometimes solved by means of tools that 
are not purely combinatorial or elementary, and which originate in the other 
areas of mathematics. And not only that, these connections and applications 
merge to new theories which are investigated on their own and some of which 
belong to very active areas of contemporary mathematics. As if the hard 
problems inspire the development of new tools which then became a coherent 
group of results that may be called theories. This phenomenon is known to 
most professionals and was nicely described by Tim Gowers as two cultures. 
[W. T. Gowers, The two cultures of mathematics, in Mathematics: Frontiers 
and Perspectives (Amer. Math. Soc., Providence, RI, 2000), 65—78.] On one 
side, problem solvers, on the other side, theory builders. Erdés’ mathematics 
seems to be on one side. But perhaps this is misleading. As an example, 
see the article in the first volume Unexpected applications of polynomials 
in combinatorics by Larry Guth and the article in the second volume Flag 
algebras: an interim report by Alexander Razborov for a wealth of theory 
and structural richness. Perhaps, on the top level of selecting problems and 
with persistent activity in solving them, the difference between the two sides 
becomes less clear. (Good) mathematics presents a whole. 
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Time will tell. Perhaps one day we shall see Paul Erdés not as a theory 
builder but as a man whose problems inspired a wealth of theories. 

People outside of mathematics might think of our field as a collection 
of old tricks. The second edition of mathematics of Paul Erdés is a good 
opportunity to see how wrong this popular perception of mathematics is. 


La Jolla, USA R.L. Graham 
Prague, Czech Republic J. NeSsetril 
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IN MEMORIAM 


Paul Erdos 


26.3.1913-20.9.1996 


The week before these volumes were scheduled to go to press, we learned that 
Paul Erdos died on September 20, 1996. He was 83. Paul died while attending 
a conference in Warsaw, on his way to another meeting. In this respect, this 
is the way he wanted to “leave”. In fact, the list of his last month’s activities 
alone inspires envy in much younger people. 

Paul was present when the completion of this project was celebrated 
by an elegant dinner in Budapest for some of the authors, editors and 
Springer representatives attending the European Mathematical Congress. He 
was especially pleased to see the first copies of these volumes and was perhaps 
surprised (as were the editors) by the actual size and impact of the collection 
(On the opposite page is the collection of signatures from those present at 
the dinner, taken from the inside cover of the mock-up for these volumes). 
We hope that these volumes will provide a source of inspiration as well as a 
last tribute to one of the great mathematicians of our time. And because of 
the unique lifestyle of Paul Erd6s, a style which did not distinguish between 
life and mathematics, this is perhaps a unique document of our times as well. 


R.L. Graham 
J. Negetiil 
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In 1992, when Paul Erdés was awarded a Doctor Honoris Causa by Charles 
University in Prague, a small conference was held, bringing together a distin- 
guished group of researchers with interests spanning a variety of fields related 
to Erdés’ own work. At that gathering, the idea occurred to several of us 
that it might be quite appropriate at this point in Erdos’ career to solicit a 
collection of articles illustrating various aspects of Erdés’ mathematical life 
and work. The response to our solicitation was immediate and overwhelming, 
and these volumes are the result. 

Regarding the organization, we found it convenient to arrange the papers 
into six chapters, each mirroring Erdés’ holistic approach to mathematics. 
Our goal was not merely a (random) collection of papers but rather a 
thoroughly edited volume composed in large part by articles explicitly 
solicited to illustrate interesting aspects of Erdds and his life and work. 
Each chapter includes an introduction which often presents a sample of 
related Erdés’ problems “in his own words”. All these (sometimes lengthy) 
introductions were written jointly by editors. 

We wish to thank the nearly 70 contributors for their outstanding efforts 
(and their patience). In particular, we are grateful to Béla Bollobas for his 
extensive documentation of Paul Erd6s’ early years and mathematical high 
points; our other authors are acknowledged in their respective chapters. We 
also want to thank A. Bondy, G. Hahn, I. Ouhel, K. Marx, J. Naceradsky 
and Ché Graham for their help and for the use of their works. At various 
stages of the project, the book was supported by AT&T Bell Laboratories, 
GACR 2167 and GAUK 351. We also are indebted to Dr. Joachim Heinze 
and Springer Verlag for their encouragement and support. Finally, we would 
like to record our extreme debt to Susan Pope (at AT&T Bell Laboratories) 
who somehow (miraculously) managed to convert more than 50 manuscripts 
of all types into the attractive form they now have. 

Here then is a unique portrait of a man who has devoted his whole being 
to “proving and conjecturing” and to the pursuit of mathematical knowledge 
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and understanding. We hope that this will form a lasting tribute to one of 
the great mathematicians of our time. 


Murray Hill, USA R.L. Graham 
Praha, Czech Republic J. Nesetiil 
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Paul Erdos lecturing. 


Photo by George Csicsery. 
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Photo by Gena Hahn. 
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I. Combinatorics and Graph Theory 


Introduction 


Erdos’ work in graph theory started early and arose in connection with D. 
Konig, his teacher in prewar Budapest. The classic paper of Erdés’ and 
Szekeres from 1935 also contains a proof in “graphotheoretic terms.” The 
investigation of the Ramsey function led Erdés to probabilistic methods and 
seminal papers in 1947, 1958 and 1960. It is perhaps interesting to note 
that three other very early contributions of Erdés’ to graph theory (before 
1947) were related to infinite graphs: infinite Eulerian graphs (with Gallai 
and Vaszoni) and a paper with Kakutani on nondenumerable graphs (1943). 
Although the contributions of Erdés to graph theory are manifold, and he 
proved (and always liked) beautiful structural results such as the Friendship 
Theorem (jointly with V. T. Sdés and Ko6vari), and compactness results 
(jointly with N. G. de Bruijn), his main contributions were in asymptotic 
analysis, probabilistic methods, bounds and estimates. Erdés was the first 
who brought to graph theory the experience and rigor of number theory 
(perhaps being preceded by two papers by V. Jarnik, one of his early 
coauthors). Thus he contributed in an essential way to lifting graph theory 
up from the “slums of topology.” 

This part contains a “special” problem paper not by Erdés but by his 
frequent coauthors from Memphis: R. Faudree, C. C. Rousseau and R. Schelp 
(well, there is actually an Erdés supplement there as well). We encouraged 
the authors to write this paper and we are happy to include it in this 
volume. This part also includes two papers coauthored by Béla Bollobas, 
who is one of Erdos’ principal disciples. Bollobaés contributed to much 
of Erdés’ combinatorial activities and wrote important books about them 
(Extremal Graph Theory, Introduction to Graph Theory, Random Graphs). 
His contributions to this chapter (coauthored with his two former students G. 
Brightwell and A. Thomason) deal with graphs (and thus are in this chapter) 
but they by and large employ random graph methods (and thus they could be 
also be at home in the other volume). The main questions there may be also 
considered as extremal graph theory questions (and thus they could fit into 
the following part). Other contributions to this chapter, which are related to 
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some aspect of Erdés’ work or simply pay tribute to him are by N. Alon, Z. 
Fiuiredi, M. Aigner and E. Triesch, S. Bezrukov and K. Engel, A. Gydrfas, 
S. Brandt, N. Sauer and H. Wang, H. Fleischner and M. Stiebitz, and D. 
Beaver, S. Haber and P. Winkler. 

In 1995/1996, when the contents of these volumes were already crystal- 
lizing, we asked Paul Erddés to isolate a few problems, both recent and old, 
for each of the eight main parts of this book. To this part on Combinatorics 
and Graph Theory he contributed the following collection of problems and 
comments. 


Erdés in his own words 


Many years ago I proved by the probability method that for every k and r 
there is a graph of girth > r and chromatic number > k. Lovasz when he was 
still in high school found a fairly difficult constructive proof. My proof still 
had the advantage that not only was the chromatic number of G(n) large but 
the largest independent set was of size < en for every € > 0 if n > no(e,7r, k). 
NeSetril and V. Rodl later found a simpler constructive proof. 

There is a very great difference between a graph of chromatic number No 
and a graph of chromatic number > &;. Hajnal and I in fact proved that if 
G has chromatic number &; then G must contain a Cy and more generally 
G contains the complete bipartite graph K(n,%1) for every n < No. Hajnal, 
Shelah and I proved that every graph G of chromatic number Xj must contain 
for some ko every odd cycle of size > ko (for even cycles this was of course 
contained in our result with Hajnal), but we observed that for every k and 
every m there is a graph of chromatic number m which contains no odd cycle 
of length < k. Walter Taylor has the following very beautiful problem: Let 
G be any graph of chromatic number Nj. Is it true that for every m > Xy 
there is a graph G,, of chromatic number m all finite subgraphs of which 
are contained in G? Hajnal and Komjath have some results in this direction 
but the general conjecture is still open. If it would have been my problem, I 
certainly would offer 1,000 dollars for a proof or a disproof. (To avoid financial 
ruin I have to restrict my offers to my problems.) 

Let k be fixed and n > oo. Is it true that there is an f(k) so that if G(n) 
has the property that for every m every subgraph of m vertices contains an 
independent set of size m/2—k then G(n) is the union of a bipartite graph 
and a graph of < f(k) vertices, i.e., the vertex set of G(n) is the union of three 
disjoint sets $1, Sz and S3 where S$) and S2 are independent and |S3| < f(k). 
Gyarfas pointed out that even the following special case is perhaps difficult. 
Let m be even and assume that every m vertices of our G(n) induces an 
independent set of size at least m/2. Is it true then that G(n) is the union of 
a bipartite graph and a bounded set? Perhaps this will be cleared up before 
this paper appears, or am I too optimistic? 

Hajnal, Szemerédi and I proved that for every € > 0 there is a graph of 
infinite chromatic number for which every subgraph of m vertices contains 
an independent set of size (1 — €)m/2 and in fact perhaps (1 — «)m/2 can be 
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replaced by m/2— f(m) where f(m) tends to infinity arbitrarily slowly. A 
result of Folkman implies that if G is such that every subgraph of m vertices 
contains an independent set of size m/2—k then the chromatic number of G 
is at most 2k + 2. 

Many years ago Hajnal and I conjectured that if G is an infinite graph 
whose chromatic number is infinite, then if a, < a2 < ... are the lengths of 
the odd cycles of G we have 


1 
2 
u 
and perhaps a1 < a2 < ... has positive upper density. (The lower density can 
be 0 since there are graphs of arbitrarily large chromatic number and girth.) 

We never could get anywhere with this conjecture. About 10 years ago 
Mih6ék and I conjectured that G must contain for infinitely many n cycles 
of length 2”. More generally it would be of interest to characterize the 
infinite sequences A = {a, < ag < ...} for which every graph of infinite 
chromatic number must contain infinitely many cycles whose length is in A. 
In particular, assume that the a; are all odd. 

All these problems are unattackable (at least for us). About 3 years ago 
Gyarfas and I thought that perhaps every graph whose minimum degree is 
> 3 must contain a cycle of length 2" for some k > 2. We became convinced 
that the answer almost surely will be negative but we could not find a 
counterexample. We in fact thought that for every r there must be a G, 
every vertex of which has degree > r and which contains no cycle of length 
2* for any k > 2. The problem is wide open. 

Gyarfas, Komlds and Szemerédi proved that if k is large and a, < ag <... 
are the lengths of the cycles of a G(n, kn), that is, an n-vertex graph with 
kn edges, then 


S- + > clogn. 


The sum is probably minimal for the complete bipartite graphs. 

(Erdés-Hajnal) If G has large chromatic number does it contain two (or k& 
if the chromatic number is large) edge-disjoint cycles having the same vertex 
set? It surely holds if G(n) has chromatic number > n* but nothing seems to 
be known. 

Fajtlowicz, Staton and I considered the following problem (the main 
idea was due to Fajtlowicz). Let F(n) be the largest integer for which 
every graph of n vertices contains a regular induced subgraph of > F'(n) 
vertices. Ramsey’s theorem states that G(n) contains a trivial subgraph, i-e., 
a complete or empty subgraph of clog n vertices. (The exact value of c is not 
known but we know 1/2 < c < 2.) We conjectured F'(n)/logn — ov. This is 
still open. We observed F'(5) = 3 (since if G(5) contains no trivial subgraph 
of 3 vertices then it must be a pentagon) . Kohayakawa and I worked out 
the F'(7) = 4 but the proof is by an uninteresting case analysis. (We found 
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that this was done earlier by Fajtlowicz, McColgan, Reid and Staton, see Ars 
Combinatoria vol 39.) It would be very interesting to find the smallest integer 
n for which F'(n) = 5, ie., the smallest n for which every G(n) contains a 
regular induced subgraph of > 5 vertices. Probably this will be much more 
difficult than the proof of F'(7) = 4 since in the latter we could use properties 
of perfect graphs. Bollobds observed that F'(n) < c/n for some c > 0. 

Let G(10n) be a graph on 10n vertices. Is it true that if every index 
subgraph of 5n vertices of our G(10n) has > 2n? + 1 edges then our G(10n) 
contains a triangle? It is easy to see that 2n? edges do not suffice. A 
weaker result has been proved by Faudree, Schelp and myself at the Hakone 
conference (1992, I believe) see also a paper by Fan Chung and Ron Graham 
(one of the papers in a volume published by Bollobas dedicated to me). 

A related forgotten conjecture of mine states that if our G(10n) has more 
than 20n? edges and every subgraph of 5n vertices has > 2n? edges then 
our graph must have a triangle. Simonovits noticed that if you replace each 
vertex of the Petersen graph by n vertices you get a graph of 10n vertices, 
15n? edges, no triangle and every subgraph of 5n vertices contains > 2n? 
edges. 
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So much for P. Erdés in 1995. Let us add that since that time some of 
these problems were solved, some are open and some seem to be dormant. 
Some were subject of intensive study. The reference to the above Hakone 
conference is: 

P. Erdés, R. J. Faudree, C. C. Rousseau, R. H. Schelp, A local density 
condition for triangles, Discrete Math. 127, 1-2 (1994), 153-161. 

(The conference was The Second Japan Conference on Graph Theory and 
Combinatorics, Aug 18-22, 1990 in Hakone.) 

The mentioned paper by Fan Chung et al. is the following: 

F. R. K. Chung and R. L. Graham, On graphs not containing prescribed 
induced subgraphs, in A Tribute to Paul Erdos, ed. by A. Baker, B. Bollobas 
and A. Hajnal, Cambridge University Press (1990), 111-120. 

One of these problems was quoted by Erdés much earlier. For example the 
problem of Taylor was mentioned as early as 1975; (W. Taylor: Problem 42. 
In: Combinatorial Structures and Their Applications, Proc. Calgary Internat. 
Conf. 1969, Gordon and Breach 1969.) 

For more information about Erdés problems on graphs and of their 
current status see: 

F. R. K. Chung, R. L. Graham, Erddés on Graphs: His Legacy of Unsolved 
Problems, A K Peters, Cambridge, MA 1993, xiv+142 pp. 
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Summary. Associate to a finite directed graph G(V,£) its out-degree resp. 
in-degree sequences dt, d~ and the corresponding neighborhood lists N*, N7 
(when G is a labeled graph). We discuss various problems when sequences resp. 
lists of sets can be realized as degree sequences resp. neighborhood lists of a directed 
graph. 


1. Introduction 


Consider a finite graph G(V, £). Let us associate with G a finite list P(G) of 
parameters, e.g. the degrees, the list of cliques, the chromatic polynomial, or 
whatever we like. For any set P of invariants there arise two natural problems: 


(R) Realizability. Given P, when is P = P(G) for some graph G? We 
then call P graphic, and say that G realizes P. 

(U) Uniqueness. Suppose P(G) = P(H). When does this imply G = H? 
In other words, when is P a complete set of invariants? 


The best studied questions in this context are probably the reconstruction 
conjecture for (U), and the degree realization problem for (R). This latter 
problem was solved in a famous theorem of Erdés-Gallai [4] characterizing 
graphic sequences. Their theorem reads as follows: Let dj >--- > dn > 0 be 
a sequence of integers. Then (d; > --- > d,) can be realized as the degree 
sequence of a graph if and only if the degree sum is even and 


k n 
Sidi <k(k—-1)+ S© min(dj,k) (k=1,...,n). (1) 
i=1 j=k+1 

A variant of this problem concerning neighborhoods was first raised by Sés 

[13] and studied by Aigner-Triesch [2]. Consider a finite labeled graph G(V, E) 

and denote by N(u) the neighborhood of u € V. N(G) = {N(u): ue V} is 

called the neighborhood list of G. Given a list (multiset) M = (Mj,..., Nn) 
of sets. When is V = N(G) for some graph G? In contrast to the polynomial 
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verification of (1), it was shown in [2] that the neighborhood list problem 
NL is NP-complete for arbitrary graphs. For bipartite graphs, NL turns out 
to be polynomially equivalent to the GRAPH ISOMORPHISM problem. A 
general survey of these questions appears in [3]. 

In the present paper we consider directed graphs G(V, FE) on n vertices 
with or without loops and discuss the corresponding realizability problems 
for the degree resp. neighborhood sequences. We assume throughout that 
there is at most one directed edge (u,v) for any u,v € V. To every u € V 
we associate its out-neighborhood N*(u) = {v € V: (u,v) € E} and its 
in-neighborhood N~(u) = {v € V: (v,u) € E} with dt(u) = |Nt(u)| and 
d~(u) = |N~(u)| being the out-degree resp. in-degree of u. 

For both the degree realization problem and the neighborhood problem 
we have three versions in the directed case: 

(D+) Given a sequence dt = (df ,...,d;+) of non-negative integers. When 
is dt realizable as the out-degree sequence of a directed graph? 

Obviously, (D~) is the same problem. 

(Dt) Given a sequence of pairs dt = ((dj,d7),...,(djt,d;,)). When is 
there a graph G with d*(u;) = d}, d~(u;) = dj for all i? 

(Dt+,D-) Given two sequences dt = (dj,...d%), d~ = (dj,...,d;). 
When is there a directed graph such that d* is the out-degree sequence (in 
some order) and d~ the in-degree sequence? 

In an analogous way, we may consider the realization problems (V+), 
(N+), (N+,N~) for neighborhood lists. 

In Sect. 2 we consider the degree problems and in Sect. 3 the neighborhood 
problems. Section 4 is devoted to simple directed graphs when there is at most 
one edge between any two vertices and no loops. 


2. Degree Sequences 


Depending on whether we allow loops or not there are six different reconstruc- 
tion problems whose solutions are summarized in the following diagram: 


Degrees (Dt) (Dt) (Pe) 


With loops Trivial Gale-Ryser Gale-Ryser 
Without loops | Trivial Fulkerson Fulkerson 


The problems (D*) have the following trivial solutions: (d*) is realizable 
with loops if and only if a <n for all i, and without loops if and only if 
d? <n-—1 for all i. 

Consider (Dt) with loops. We represent G(V, £) as usual by its adjacency 
matrix M where the rows and columns are indexed by the vertices u,..., Un 
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with mj = 1 if (u,u;) € E and 0 otherwise. To realize a given sequence 
(d*) is therefore equivalent to constructing a 0, 1-matrix with given row-sums 
d} and column-sums d; which is precisely the content of the Gale-Ryser 
Theorem [7,11]. In fact, the Gale-Ryser Theorem applies to the situation 
(D*+,D~) as well by permuting the columns. If we do not allow loops, then 
the realization problem (D7) is settled by an analogous theorem of Fulkerson 
[5,6]. He reduces the problem of constructing a 0,1-matrix with zero trace 
and given row and column sums to a network flow problem, an approach 
which can also be used in the case of the Gale-Ryser theorem thus showing 
that both problems are polynomially decidable. Finally, we remark that the 
case (D+,D~) can be reduced to the case (D*) in view of the following 
proposition. 

Proposition 1. Suppose two sequences d+ = (df,...,d¢), d~ = (dj,... 
d~) are given. Denote by d+ (resp. d~) a non-increasing (resp. non- 
decreasing) rearrangement of d* (resp. d~). If there exists a 0,1-matrizx 
M = (mi) with Dj_1mjj = 0, V_amy = di, Vimy = dG, 
1<i<n, then there exists a 0,1-matrix M = (™m;;) satisfying ba m5; = 0, 


_ at oe 
Dy Mig =; , My =d;,, 1 <i<n. 


? 


Proof. Suppose M is given as above. By permuting the rows and columns 
of M by the same permutation (which does not change the trace) we may 


assume that d+ =d’. 


Now suppose that for some indices 1 < j, d; > d; . We will show that M 


can be transformed into a matrix M with zero trace, row sum vector dt and 
column sum vector d~, where d~ arises from d~ by exchanging d; and d; 
Since each Belmutation is generated by transpositions, this will Siviousiy 
complete the proof. To keep notation simple, we give the argument only for 
the casei = n—1, 7 = n but it is immediately clear how the general case 
works. Suppose 


a|b 


y 
x\0 


(i) If x < y, then we exchange (a;,b;) when a; = 1 and b; = 0 for d,_, —d;, 
indices i <n — 2. 


(ii) If z = 1, y = 0, then since d*_, > d} there exists some £ < n — 1 such 


that 
Mn—1,£ = 1 
Mn b - 0 
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Now let 
lola 
es ae IO 
1 |0/0 


3. Neighborhood Lists 


As in the undirected case the neighborhood problems are more involved. 
Again, we summarize our findings in a diagram and then discuss the proofs. 


Neighbors (NT) (N*) (N*,N7) 
With loops Trivial > GRAPH ISOM. # GRAPH ISOM. 
Without loops | Bipartite matching NP-complete NP-complete 


Let us consider (N+) first. Allowing loops, any list (N;*) can be realized. 
In the absence of loops, (N;*) can be realized if and only if (Nj ,...,. N°) 
has a transversal, where N° is the complement of N. So, this problem is 
equivalent to the bipartite matching problem and, in particular, polynomially 
decidable. 

Let us treat next the problem (V+) without loops. The special case N;* = 
N, for all i clearly reduces to the (undirected) neighborhood list problem 
NL which as mentioned is NP-complete. Accordingly, (N+) is NP-complete 
as well. 

We show next that the decision problem (N+) with loops is polynomially 
equivalent to the matrix symmetry problem MS defined as follows: 

The input is an n x n-matrix A with 0, l-entries, with the question: Does 
there exist a permutation matrix P such that (PA)’ = PA holds? 

Let us represent G(V,£) again by its adjacency matrix. Then, clearly, 
MS is the special case of (N+) where N;* = Nj for all i. To see the converse 
denote by z* (resp. y’) the incidence vectors of N;* (resp. N;) as row vectors, 
and set 


The problem (N+) is thus equivalent to the following decision problem: Does 
there exist a permutation matrix P such that 


PX =(PY)"? 
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Note that PX = (PY)? <=> (PX)? = PY. Now consider the 4n x 4n- 
matrix 


O| I |X |U 
I|}O]}I |W 


aaa 


where O, I are the zero-matrix and identity matrix, respectively, and U, W, Z 
are matrices with identical rows each which ensure that a permutation matrix 
R satisfying (RT)? = RI must be of the form 


Clearly, such matrices U, W, Z exist. Now (RI)? = RI if and only if 
(RiX)" = RsY, Ry = Ri, R3 = Ry = Ri, 


i.e. if and only if (Ri X)? = RY, and the result follows. 

It was mentioned in [2] that MS is at least as hard as GRAPH 
ISOMORPHISM, but we do not know whether they are polynomially 
equivalent. 

Let us, finally, turn to the problems (N+, N—). We treat (Vt, ) with 
loops, the other version is settled by a matrix argument as above. Using again 
the notation x, y', X, Y, our problem is equivalent to the following matrix 
problem: Do there exist permutation matrices P and Q such that 


PX =Y7Q, ie. PXQ?T =Y7? 


This latter problem is obviously polynomially equivalent to HYPERGRAPH 
ISOMORPHISM which is known to be equivalent to GRAPH ISOMOR- 
PHISM (see [2]). 


4. Simple Directed Graphs and Tournaments 


Let us now consider simple directed graphs and, in particular, tournaments. 
In contrast to the non-simple case, where (D*) and (N7) are trivial resp. 
polynomially solvable (bipartite matching), the problems now become more 
involved. 

As for (Dt), a necessary and sufficient condition for (d*) to be realizable 
as an out-degree sequence of a tournament was given by Landau [9]. Assume 
dj >--.> dz, then (d}) is realizable if and only if 
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k 
So df <(n-1)+...+(n-k) (K=1,...,n) 
i=1 
with equality for k =n. 
Deleting the last condition yields the corresponding result for arbitrary 
simple digraphs. 


Theorem 1. A sequence (dj > --- > d+) is realizable as out-degree sequence 
of a simple directed graph if and only if 
k 


k 
ai sD e-4) (K =1,...,n). (2) 


i=1 


Proof. The condition is obviously necessary. For the converse we make use of 
the dominance order of sequences p = (pi > ++: > pn), = (qi > +++ > Gn): 


k k 
psqa—=don<s du (k= Lyecasn): (3) 
i=l i=l 


Suppose m = )7'_, d}, and denote by L(m) the lattice of all sequences 
p=(pi >-++> pn), ;_1 pi = M, ordered by (3), see [1] for a survey on the 
uses of L(m). Let S(m) C L(m) be the set of sequences which are realizable 
as out-degree sequences of a simple digraph with m edges. 


Claim 1. S(m) is a down-set, i.e. p€ S(m),q<p=> qe S(m). 


It is well-known that the order < in L(m) is transitively generated by 
successive “pushing down boxes”, i.e. it suffices to prove the claim for g < p 
with q, = pp —1, ds = ps + 1 for some p, > p, +2, and q = p; fori # 1,8. 
Now suppose G realizes the sequence p, with d*(u;) = pj. Since dt (u,) > 
d*(us) + 2, there must be a vertex v; with (u,,ut) € E,(us,ur) ¢ E. If 
(uz,us) € EB, replace (uy, ut), (ue,Us) by (uz, Uy), (us, Ue), and if (uz,us) ¢ E, 
replace (u,, uz) by (ws, uz). In either case, we obtain a simple directed graph 
G’ with q as out-degree sequence, and the claim is proved. 


Claim 2. Supposem =(n—1)+...4+(n-—€4+1)4+r withr<n—-8@, then 
Dm =(n—-1,...,n-£+41,r) is the only maximal element of S(m) in L(m). 


By the definition (3), the sequence 9,,, clearly dominates any sequence in 
S(m), and since p,, can obviously be realized as an out-degree sequence of a 
simple digraph, it is the unique maximum of S(m). 

Taking Claims 1 and 2 together yields the characterization 


dé S(m) => d<D,, 


and this latter condition is plainly equivalent to (1). 
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As is to be expected the neighborhood list problem (N*) is considerably 
more involved. We consider the problem NLSD (neighborhood list of a simple 
digraph): 

Instance: A list N+ = (Nj,...,.Nt). 

Question: Does there exist a simple digraph G(V, £) such that N* is the 
list of its out-neighborhoods? 

In terms of the adjacency matrix, the problem reads as follows: We are 
given an n x n-matrix [ (whose rows are the incidence vectors of the sets 
N;*). Does there exist a permutation matrix R such that RT is the adjacency 
matrix of a simple digraph, i.e. such that 


REAR 2 fT 
where J is the all-ones matrix, and < is to be understood coordinate-wise. 
Theorem 2. The decision problem NLSD is NP-complete. 


For the proof we need an auxiliary result. Consider the decision problem 
k-Aut: 

Instance: A simple undirected graph G. 

Question: Does G admit an automorphism all of whose orbits have length 
at least k? 

The problem 2-Aut is the FIXED-POINT-FREE AUTOMORPHISM 
problem which was shown to be NP-complete by Lubiw [10]. 


Lemma 1. 3-Aut is NP-complete. 
Proof. We give a transformation from 2-Aut. Note first that the construction 


given by Lubiw shows that we may assume an instance G of 2-Aut which 
satisfies the following conditions: 


(a) For u#v € V, the sets N(u) ={w eV: uw € E} and N(v) are distinct. 
(b) Foru Av €eV,N(u)U {u} 4 N(v) U {v}. 


For such graphs G, Sabidussi has shown in [12] that the composition G[G] 
the automorphism group Aut Go Aut G where o denotes the wreath product 
and V(G[G]) =V x V, 


E(G[G)) = {{u1, vr}, (ug, vo} : urug € E or uy = U2, viv2 € E}. 


That is, the composition arises from G by replacing each vertex by an 
isomorphic copy of G and joining either all pairs in different copies or none 
according to whether the original vertices are adjacent or not. The wreath 
product Aut Go Aut G is defined as follows: 


Aut Go AutG = {Ohta <a) : 2 Wi € Aut G} 
where the action on V x V is given by 


(93 1,---,Un)(u,v) = (y(u), Wu(v)) with V = {1,...,n}. 
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We claim that G has a fixed-point free automorphism if and only if G[G] 
admits an automorphism with all orbit lengths at least 3. This will then prove 
our Lemma. 

Suppose first that there exists y € AutG, y(u) 4 wu for all u € V, and 
let O1,...,O% be the orbits of y on V. Suppose |O,| = ... = |O;,| = 2 and 
|O;| > 3 forh+1<i<k. Define p = (y;y1,...,%n) € Aut Go Aut G such 
that w, € {id, y} for all w and vu, = y for exactly one u in each orbit O;, 
1<i<h. Let (u,v) € V x V and denote by O its p-orbit. If the orbit O; of 
u under vy has at least three elements, then the same obviously holds for O. 
In case O; = {u, w}, suppose 7, = y. Then {(u, v), (w, v(v)), (u, e(v))} CO 
and thus |O| > 3. The case = = y is analogous. 

Now assume, conversely, that each y € Aut G has a fixed point u = uy. 
Then each p = (y;¥1,.--,Wn) has a fixed point as well, namely the pair 
(Weis Urdu): Since the transformation G —>+ G/G] is clearly polynomial, the 
proof of the Lemma is complete. 


Remark 1. By iterating the construction of the Lemma, it can be shown 
that k-Aut is NP-complete for every fired k > 2. 


Proof of Theorem 2. We provide a transformation from 3-Aut. Suppose G is 
a simple undirected graph with incidence matrix A € {0,1}"*™, where m is 
the number of edges. We may assume n > 4, m > 3. We denote by t,. the 
r X s-matrix with all entries equal to t, and set A = lnym — A. 

Now consider the (n +m +2) x (n+m + 2)-matrix 


I, A lnxt Inx1 | 


linx1 


Our theorem will be proved by the following claim: There exists a 
permutation matrix R with RI + (RI)? < J —I if and only if G admits 
an automorphism with all orbit lengths at least 3. 

Note first that row n+m-+1 is the only row with at least n+-m zeros. Since 
the (n +m + 1)-st column contains n + m ones, R must fix rown+m-+ 1. 
But then row n + m+ 2 must also be fixed, since the entry in position 
(n+m-+2,n+m-+1)in RT must be 0. Furthermore, the rows with numbers 
{1,...,n} and {n+1,...,2 +m} are permuted among themselves. Hence, 
R is of the form 


where P and Q are permutation matrices of order n and m, respectively. Now 
we have 
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P+P?<J-—-TI and 


RECT) 27 T = 
EL TI aa NT sis 


It is easily seen, that P+ P? < J —I holds if and only if the permutation 7 
corresponding to P has no cycle of length two or less. For the second condition 
we note 


PALAQ™ =PAt Chee AQ? < lax 


if and only if PA < AQ? if and only if PA = AQ’. 
Denoting by Perm, the set of permutation matrices of order n, we have 


AutG ={S € Perm, : JU € Perm, with SA = AU7}, 


and the result follows. 


The corresponding decision problem (N*) for tournaments is open, but 
we surmise that it is also NP-complete. 
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Summary. A bipartite covering of order k of the complete graph K,, on n vertices 
is a collection of complete bipartite graphs so that every edge of Ky, lies in at 
least 1 and at most k of them. It is shown that the minimum possible number 
of subgraphs in such a collection is O(kn/ *). This extends a result of Graham 
and Pollak, answers a question of Felzenbaum and Perles, and has some geometric 
consequences. The proofs combine combinatorial techniques with some simple linear 
algebraic tools. 


1. Introduction 


Paul Erdos taught us that various extremal problems in Combinatorial 
Geometry are best studied by formulating them as problems in Graph Theory. 
The celebrated Erdés-de Bruijn theorem [3] that asserts that n non-collinear 
points in the plane determine at least n distinct lines is one of the early 
examples of this phenomenon. An even earlier example appears in [4] and 
many additional ones can be found in the surveys [5] and [12]. In the present 
note we consider another example of an extremal geometric problem which 
is closely related to a graph theoretic one. Following the Erdés tradition 
we study the graph theoretic problem in order to deduce the geometric 
consequences. 

A finite family C of d-dimensional convex polytopes is called k-neighborly 
if d—k < dim(C NC’) < d—1 for every two distinct members C’ and C” 
of the family. In particular, a 1-neighborly family is simply called neighborly. 
In this case the dimension of the intersection of each two distinct members 
of the family is precisely d — 1. Neighborly families have been studied by 
various researchers, see, e.g., [10, 14, 15, 16, 17]. In particular it is known 
that the maximum possible cardinality of a neighborly family of d-simplices 
is at least 24 [16] and at most 2¢*++ [14]. The maximum possible cardinality of 
a neighborly family of standard bores in R®, that is, a neighborly family of d- 
dimensional boxes with edges parallel to the coordinate axes, is precisely 
d+ 1. This has been proved by Zaks [17], by reducing the problem to 
a theorem of Graham and Pollak [8] about bipartite decompositions of 
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complete graphs. In the present note we consider the more general problem 
of k-neighborly families of standard boxes. The following result determines 
the asymptotic behaviour of the maximum possible cardinality of such a 
family. 


Theorem 1. For 1 < k < d, let n(k,d) denote the maximum possible 
cardinality of a k-neighborly family of standard boxes in R¢. Then 


k-1 ko 
(i) ($)° < TE (42) +1) <nled) < y 21) < 2788)" 


i= 

This answers a question of Felzenbaum and Perles [6], who asked if for 
fixed k, n(k, d) is a nonlinear function of d. 

As in the special case k = 1, the function n(k,d) can be formulated in 
terms of bipartite coverings of complete graphs. A bipartite covering of a 
graph G is a family of complete bipartite subgraphs of G so that every edge 
of G belongs to at least one such subgraph. The covering is of order k if 
every edge lies in at most k such subgraphs. The size of the covering is the 
number of bipartite subgraphs in it. The following simple statement provides 
an equivalent formulation of the function n(k, d). 


Proposition 1. For 1<k <d,n(k,d) is precisely the maximum number of 
vertices of a complete graph that admits a bipartite covering of order k and 
size d. 


The rest of this note is organized as follows In Sect.2 we present the 
simple proof of Proposition 1. The main result, Theorem 1, is proved in 
Sect. 3. Section 4 contains some possible extensions and open problems. 


2. Neighborly Families and Bipartite Coverings 


There is a simple one-to-one correspondence between k-neighborly families 
of n standard boxes in R¢ and bipartite coverings of order k and size d of 
the complete graph K,,. To see this correspondence, consider a k-neighborly 
family C = {Ci,...,C,} of n standard boxes in R¢. Since any two boxes 
have a nonempty intersection, there is a point in the intersection of all the 
boxes (by the trivial, one dimensional case of Helly’s Theorem, say). By 
shifting the boxes we may assume that this point is the origin O. If n > 2, 
O must lie in the boundary of each box, since it belongs to all boxes, and 
the dimension of the intersection of each pair of boxes is strictly smaller than 
d. Put V = {1,2,...,n}. For each coordinate x;, 1 < i < d, let H; be the 
complete bipartite graph on V whose sets of vertices are Aa = {j: C; is 
contained in the half space 2; > 0}, and V; = {j : C; is contained in the 
half space x; < O}. It is not difficult to see that if the dimension of C, M Cy 
is d—r, then the edge pq of the complete graph on V lies in exactly r of the 
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subgraphs H;. Therefore, the graphs H; form a bipartite covering of order k 
and size d. 

Moreover, the above correspondence is invertible; given a bipartite 
covering of the complete graph on V = {1,2,...,n} by complete bipartite 
subgraphs H,,...,H, one can define a family of n standard boxes as follows. 
Let Ve and V,; denote the two color classes of H;. For each 7, 1 <j <n, 
let C; be the box defined by the intersection of the unit cube [—1,1]¢ with 
the half spaces x; > 0 for all i for which j € a and the half spaces x7; < 0 
for all 2 for which j € V,~. If the given covering is of order k, the family of 
standard boxes obtained is k-neighborly. 

The correspondence above clearly implies the assertion of Proposition 1, 
and enables us to study, in the next section, bipartite coverings, in order to 
prove Theorem 1. 


3. Economical Bipartite Coverings 


In this section we prove Theorem 1. In view of Proposition 1 we prove it 
for the function n(k,d) that denotes the maximum number of vertices of a 
complete graph that admits a bipartite covering of order k and size d. 

Part (i) of the theorem is essentially known. The fact that n(1,d) =d+1 
is a Theorem of Graham and Pollak [8, 9]. See also [7, 11, 18, 13, 1] and [2] 
for various simple proofs and extensions. The statement that for every fixed 
d, n(k,d) is a non-decreasing function of k is obvious and the claim that 
n(d,d) = 2% is very simple. Indeed, the chromatic number of any graph 
that can be covered by d bipartite subgraphs is at most 2%, implying that 
n(d,d) < 24. To see the lower bound, let V be a set of 2¢ vertices denoted by 
all the binary vectors € = (€1,...,€a), and let H; be the complete bipartite 
graph whose classes of vertices are all the vertices labelled by vectors with 
€; = 0 and all the vertices labelled by vectors with e; = 0. Trivially H,,..., Ha 
form a bipartite covering (of order d and size d) of the complete graph on V, 
showing that n(d,d) = 2%, as claimed. 

The lower bound in part (ii) of the theorem is proved by a construction, 
as follows. For each i, 0 < i < k—1, define dj = |(d+i%)/k|] and D; = 
{1,2,...,di,d; +1}. Observe that pen d, = d. Let V denote the set of 
vectors of length k defined as follows 


VY S49 (egyttyie. ep) cape. Det. 


For each r, 0 < r < k—1, and each j, 1 < j < d,, let H,,; denote the 
complete bipartite graph on the classes of vertices 


Ary = { (€0,€1,---,€h-1) :€ =J 
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and 
Bag = £ (@y€1y---s6e-t) 16 2441} 


Altogether there are | d, = d bipartite subgraphs H,.;. It is not too 
difficult to see that they form a bipartite covering of the complete graph on 
V. In fact, if (€9,...,€%-1) and (€G,...,€,_1) are two distinct members of V, 
and they differ in s coordinates, then the edge joining them lies in precisely 
s of the bipartite graphs. Since 1 < s < k for each such two members, the 
above covering is of order k, implying the lower bound in part (ii) of the 
theorem. 

The upper bound in part (ii) is proved by a simple algebraic argument. 
Let H,,...,Hq be a bipartite covering of order & and size d of the complete 
graph on the set of vertices N = {1,2,...,n}. Let A; and B; denote 
the two vertex classes of H;. For each 1 € N, define a polynomial P; = 
Pi(a1,..-,2d;Y1,---, Ya) as follows: 


For each i € N let e; = (bi1,..., bia, @i1,---,@ia) be the zero-one vector in 
which aj, = 1 if i € A, (and aj, = 0 otherwise), and, similarly, bj, = 1 if 
i € By (and bi, = 0 otherwise). The crucial point is the fact that 


Pi(e;) = 0 for alli Aj and P,(e;) £0. (1) 


This holds as the value of the sum 


Set 


p:i€Ap qicBg 


for Lp» = bjp and Yq = Gjq is precisely the number of bipartite subgraphs in 
our collection in which i and j lie in distinct color classes. This number is 0 
for 1 = j and is between 1 and k for all i 4 j, implying the validity of (1). 
Let P; = Pi(x1,...,@a,Y1,---,Ya) be the multilinear polynomial ob- 
tained from the standard representation of P; as a sum of monomials by 
replacing each monomial of the form c]],<¢ 72° [Tier y)’, where all the 0, 
and 7; are positive, by the monomial c][,¢¢ %s [],er yz. Observe that when 


all the variables x,, yq attain 0, 1-values, P;(#1,..., ya) = Pi(x1,...,ya) since 
for any positive 6, 0° = 0 and 1° = 1. Therefore, by (1), 
P;(e;) = 0 for alli xj and P;(e;) x 0. (2) 


By the above equation, the polynomials P; (i € N) are linearly independent. 
To see this, suppose this is false, and let 


S 7 cP; (a1, ---5 ya) => 0, 


iE N 
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be a nontrivial linear dependence between them. Then there is an i’ € N so 
that cy 4 0. By substituting (21,..., ya) = ei we conclude, by (2) that cy = 
0, contradiction. Thus these polynomials are indeed linearly independent. 
Each polynomial P; is a multilinear polynomial of degree at most k. Moreover, 
by their definition they do not contain any monomials that contain both x; 
and y; for the same i. It thus follows that all the polynomials P; are in the 
space generated by all the monomials [[,-5 %s [],er ys, where S and T range 
over all subsets of N satisfying |S| + |T| < k and SNT = 9. Since there are 
m= = ay (“) such pairs S, 7’, this is the dimension of the space considered, 
and as the polynomials P; are n linearly independent members of this space 
it follows that n < m. This completes the proof of part (ii) and hence the 
proof of Theorem 1. 


4. Concluding Remarks and Open Problems 


The proof of the upper bound for the function n(k, d) described above can be 
easily extended to the following more general problem. Let K be an arbitrary 
subset of cardinality k of the set {1,2,...,d}. A bipartite covering Hi,..., Ha 
of size d of the complete graph K,, on n vertices is called a covering of type 
K if for every edge e of K,,, the number of subgraphs H; that contain e is a 
member of kK. The proof described above can be easily modified to show that 
the maximum n for which K,, admits a bipartite covering of type K and size 
d, where |K| = k, is at most oe FC There are several examples of sets 
for which one can give a bigger lower bound than the one given in Theorem 1 
for the special case of K = {1,...,k}. For example, for K = {2,4}, there is a 
bipartite covering H,,...,Hq of type K of a complete graph on n = 1+ ey) 
vertices. To see this, denote the vertices by all subsets of cardinality 0 or 2 of 
a fixed set D of d elements and define, for each i € D, a complete bipartite 
graph whose classes of vertices are all subsets that contain 7 and all subsets 
that do not contain 7. Similar examples exist for types K of bigger cardinality. 

One can consider bipartite coverings of prescribed type of other graphs 
besides the complete graph, and the algebraic approach described above can 
be used to supply lower bounds for the minimum possible number of bipartite 
subgraphs in such a cover, as a function of the rank of the adjacency matrix 
of the graph (and the type). 

The main problem that remains open is, of course, that of determining 
precisely the function n(k,d) for all k and d. Even the precise determination 
of n(2,d) seems difficult. 
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Summary. Two parties are said to “share a secret” if there is a question to which 
only they know the answer. Since possession of a shared secret allows them to 
communicate a bit between them over an open channel without revealing the value 
of the bit, shared secrets are fundamental in cryptology. 

We consider below the problem of when two parties with shared knowledge can 
use that knowledge to establish, over an open channel, a shared secret. There are 
no issues of complexity or probability; the parties are not assumed to be limited in 
computing power, and secrecy is judged only relative to certainty, not probability. 
In this context the issues become purely combinatorial and in fact lead to some 
curious results in graph theory. 

Applications are indicated in the game of bridge, and for a problem involving 
two sheriffs, eight suspects and a lynch mob. 


1. Introduction 


Suppose two parties—let us call them “Alice” and “Bob”—share a secret, 
that is, they have common knowledge possessed by no one else; then Alice 
may use her secret to transmit a bit to Bob in such a way that no eavesdropper 
can deduce the value of the bit. For example, if Alice and Bob are the only 
people in the world who know whether the current US President wears a wig, 
Alice may send Bob the following message (or the same message, with 0 and 
1 interchanged): 


My bit is 0 if the President wears a wig, 1 otherwise. 


While Eve (an eavesdropper) may believe that the President probably 
does not wear a wig, and therefore that Alice’s bit is more likely to be 1 than 
0, her inability to determine the value of the bit with certainty is all that 
concerns us here. 

This method of encryption is called a “one-time pad”; Alice and Bob 
share a bit of information, and they can use it (once) to pass a bit in secret. 
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There are, however, situations where even though Alice and Bob appear 
to possess shared information not available to the public, this information 
does not take the form of a shared secret. Nonetheless, Alice and Bob may 
be able to isolate a shared secret by communicating with each other, even 
though their messages are public. Since this is precisely the situation where 
cryptologic methods are needed (communication lines are available, but not 
private), Alice and Bob are almost as well off here as if they had begun 
with a shared secret; they must merely spend a few preliminary rounds of 
communication in establishing the secret. 

Let us give two examples of such situations, before proceeding further. 


(1) The game of bridge: Here two partners wish to communicate in 
private, but the rules of the game require that all communication be 
done by legal bids and plays, about which there may be no prior private 
understandings. Thus, there are initially no shared secrets. But there is 
private information: each player knows, by virtue of looking at his own 
hand, 13 cards that do not belong to his partner. Can they make use of 
this information to communicate in private? 

(2) The ‘two sheriffs’ problem: Two sheriffs in neighboring towns are 
on the track of a killer, in a case involving eight suspects. By virtue of 
independent, reliable detective work, each has narrowed the list to only 
two. Now they are engaged in a telephone call; their object is to compare 
information, and if their pairs overlap in just one suspect, to identify him 
(the killer) and put out a.p.b.’s so as to catch him in either town. 

The difficulty is that their telephone line has been tapped by the 
local lynch mob, who know the original list of suspects but not which 
pairs the sheriffs have arrived at. If they are able to identify the killer 
with certainty as a result of the phone call, he will be lynched before he 
can be arrested. 

Can the sheriffs accomplish their objective without tipping off the 
mob? 


2. The Mathematical Model 


One natural model for common knowledge is obtained by imagining that in 
any situation there is an underlying finite space S of possibilities of which 
any one element may be “the truth.” Alice’s knowledge concerning S' at any 
point consists of some subset X of S, meaning that X is precisely the set of 
truths consistent with what Alice knows. As Alice communicates with Bob 
she obtains more information, and her knowledge set X shrinks accordingly. 

At any time the “true point” must lie both in X and in Bob’s knowledge 
set Y, but if there are two or more points in X MY Alice and Bob will never 
be able to choose among them by communicating with each other. Hence for 
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our purposes if X is a possible knowledge set for Alice and Y for Bob, our 
only concern is whether they intersect. 

Consequently we choose to model common knowledge by using a mere 
vertex to represent a possible knowledge set of Alice’s, and similarly for Bob; 
we connect vertex « of Alice’s with vertex y of Bob’s when the corresponding 
knowledge sets intersect, that is, when the two vertices are simultaneously 
possible. The “truth” is thus represented by some adjacent pair of vertices, 
i.e. an edge. 

Alice and Bob’s knowledge at any time thus constitutes a graph, which, in 
accordance with cryptographic tradition, is assumed to be known to everyone 
in the world. The interpretation of this graph will, we hope, become clear to 
the reader after some examples. 

It is convenient to formalize our model as follows. 


Definition 1. A bigraph is a finite, non-empty collection H of ordered pairs 
such that if (x,y) is in H then (y, z) is not. 


Elements of the set A(H) := {x : (x,y) € H for some y} will be termed 
“Alice’s vertices” and are perforce distinct from the symmetrically defined 
“Bob’s vertices” in B(H). Thus the elements of H are edges of a bipartite 
graph, but note that the vertices come equipped with a labelled left-right 
(Alice-Bob) partition and that isolated vertices cannot arise. 

Our model now consists of a bigraph H, known to all, the edges of which 
represent possible truths. Alice knows the endpoint in A(#) of the true edge, 
Bob its endpoint in B(H); in other words, if the true edge is (a, y) then Alice 
knows x and Bob knows y. 

We say that Alice and Bob share a secret if there is a question to which 
they know the answer and Eve does not. In the wig example, the question 
“Does the President wear a wig?” can be answered only by Bob and Alice, 
so they indeed share a secret in this case. Here, the bigraph H consists of a 
pair of disjoint edges, one corresponding to “the President wears a wig” and 
the other to “the President does not wear a wig.” The disconnectivity of H 
is its crucial property: 


Theorem 1. Two parties share a secret if and only if their bigraph is 
disconnected. 


Proof. It is immediate that Alice and Bob share a secret whenever their 
bigraph is disconnected, since if C' is one of its connected components, only 
they can answer the question “Is the true edge in C' ?”. To see the converse, 
let Q be the given question and let a1,a2,... be its possible answers (from 
Eve’s point of view). Write “(u, v)#a,” if it is simultaneously possible for a; 
to be the answer to Q, and (u,v) to be the true edge of H. 

We now note that if (x, y)#a; and (u,v)#a, for i # j, then (x,y) and 
(u,v) can be neither identical nor adjacent; if, for example, x = u then when 
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Alice’s end of the true edge is x she will be unable to decide between answers 
a, and a;. 

It follows that the edges consistent with the various answers a; determine 
a partition of H, each part of which is a non-empty union of connected 
components; since the number of possible answers must be at least two, H is 


disconnected. 


If H is connected, are Alice and Bob doomed never to share a secret? Of 
course, if they can arrange a private (secure) conversation, they can agree on 
some string of bits and thus share as many secrets as they wish; this indeed 
is often done in traditional cryptography, in the name of agreeing on or 
distributing key. Unfortunately this phase is often dangerous and sometimes 
impossible; else, cryptography would be unnecessary. However, Alice and Bob 
may be able to use their common knowledge (reflected in the structure of H) 
to isolate a common secret by means of a public conversation; and it is just 
this process which we wish to investigate. 

Consider, for example, the bigraph H = {(a1,61), (1,62), (ae, be), 
(a2, b3), (a3, b3), (a3, 64), (a4, ba), (a4, b1)}. H is an 8-cycle, thus connected, 
but Alice and Bob can disconnect it as follows: if Alice’s end of the true edge is 
a, or a3 she says so: “My end of the true edge is either a; or a3.” Bob can tell 
by looking at his end which of the two possibilities is the case, hence they now 
share a secret; this is reflected in the fact that after Alice’s announcement, 
their bigraph is disconnected (see Fig. 1). Of course, had Alice’s end been 
G2 OF @4, an announcement to that effect would also have done the job. (We 
shall see later that two-sided conversations may be necessary to disconnect 


some bigraphs.) 
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Fig. 1 Alice separates an 8-cycle. 
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A A 
K K 
Q Q 


Fig. 2 Bigraph for dealing 2 cards from a 3-card deck. 


The next example is inspired by the work of Winkler [4, 5] on cryptologic 
techniques for the game of bridge, and the recent works of Fischer, Paterson, 
and Rackoff [2] and Fischer and Wright [3] using random deals of cards for 
cryptographic key. A card is dealt at random, face down, to each of Alice and 
Bob, from a three-card deck consisting (say) of an Ace, a King, and a Queen. 
The remaining card is discarded unseen. Here H is a 6-cycle (see Fig. 2). The 
six edges correspond to the six possible deals; the set A(#) consists of Alice’s 
three possible holdings (“A”, “K” or “Q”) and similarly for B(H). The fact 
that “K” in A(H) and “K” in B(#) are not adjacent corresponds to the fact 
that Alice and Bob cannot both hold the King. Clearly Alice and Bob share 
some knowledge in this situation, but as we shall see it is not enough for 
them to be assured of being able to isolate a shared secret. 

For a third example, suppose H is a 10-cycle on vertices vg, v1,..., U9 
with v; adjacent to v; just when |i — j| = 1 mod 10. (See Fig. 3.) 

Let A(#) consist of the vertices of even index. Then the following protocol 
allows Alice to disconnect the bigraph: if she holds v;, she chooses 7 to be 
either i+ 4 ori +6 mod 10, then tells Bob: 


I hold either v; or v;. 


This protocol is said to be non-deterministic because Alice has more 
than one choice of message for a given holding. Non-determinism as used 
here is thus quite different from its use in complexity theory, and in fact is 
more closely related in some respects to randomization (despite the absence 
of probability in our model). In particular non-deterministic communication 
protocols are quite practical. 

Incidentally, we assume nothing is given away by the order in which 
objects are named in a semantically symmetric expression such as “v; or v;.” 
In a deterministic protocol we can insure this by a naming convention, such as 
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Fig. 3 Non-deterministically separating a 10-cycle. 


enforced alphabetical order, but our mathematical model for communication 
will obviate the problem. 

In order to define “communication protocol” rigorously we need to 
define “conversation”, even though the latter definition needs the former for 
interpretation. Accordingly, a conversation will be a finite string m1,...,mz 
of positive integers, communicated alternately by Alice and Bob beginning 
with Alice. 

Although this seems perhaps a limited vocabulary for communication, it is 
in fact completely general because meanings can be assigned to the numbers 
via an agreed-upon protocol. The communication protocol specifies under 
precisely what circumstances a given number may be uttered. Thus, we operate 
in effect in the “political” theory of communication: when someone says 
something we ask not “what does that mean” but “under what circumstances 
would he/she have said that.” This model is both stronger and simpler than 
one in which messages are sentences. To see its effect, consider solving the 
Cio bigraph above by using always vertices 7 and i+ 4. This looks reasonable 
at first glance but is actually a sham. For example, Alice would not then say 
“My end is 4 or 8” if she held 8. Hence an eavesdropper can eliminate 8 if she 
hears this, preventing disconnection of the bigraph and ruining the protocol. 


3. Deterministic Separation 


Definition 1. A deterministic communication protocol for the bigraph H 
is a sequence of functions f,,... into the positive integers, such that f, is a 
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function of x (Alice’s end of the true edge), and each subsequent f; depends 
on the values of fi,..., fi-1 and etther on x, if i is odd, or on y (Bob’s end 
of the true edge) if i is even. 


Thus, a deterministic communication protocol, combined with a true 
edge, produces a unique conversation given by m; = fi(@;m1,...,mi—-1) for é 
odd and m; = fi(y;7m1,...,™m-1) for i even. After step i of the conversation, 
the situation can be again described by a bigraph H* consisting of those 
edges (u,v) such that if (u,v) had been the true edge the conversation would 
have been as seen. H* is a sub-bigraph of H which contains the actual 
true edge; in fact H* is obtained from H‘~! by deleting all edges incident 
to certain left-hand vertices (for 7 odd) or certain right-hand vertices (for 
i even). The vertices which survive on the left (when i is odd) are just 
{u: fi(ujmi,...,mi-1) = mi}. 

For our purposes a conversation (7m ,..., mz) will be deemed “successful” 
if H* is disconnected, and a communication protocol for H will be said to 
“separate H” or to be a “separation protocol for H” if it always produces a 
successful conversation. Finally, H itself will be termed deterministically sep- 
arable if there is a deterministic communication protocol which separates H. 

We can also give a recursive characterization of the deterministically 
separable bigraphs. It will be useful to introduce notation for the sub-bigraph 
HA’ of a bigraph H induced by a subset A’ of A(H); namely, 


H|A' := HA (A' x B(H)). 


Thus from the standpoint of ordinary graph theory, H|A’ is obtained by 
discarding isolated vertices from the subgraph of H induced by A’ U B(#). 
The definition of H|B’ for B’ C B(#) is similar. 


Theorem 2. Let DS be the smallest symmetric class of bigraphs which 
contains the disconnected bigraphs and has the following property: for any 
bigraph H, if there is a partition A(H) = A, U Ag of Alice’s vertices such 
that H|A, and H|Ag2 are both in DS, then H is also in DS. Then DS is the 
class of deterministically separable bigraphs. 


Proof. Let us check first that the class of deterministically separable bigraphs 
is indeed closed under the operation defined in the statement of the theorem. 
It is certainly symmetrical, since a protocol which separates H can be 
modified to one which separates the dual of H (i.e. the result of reversing 
the ordered pairs in H) by switching the roles of Alice and Bob, and adding 
a meaningless message from Alice to the beginning. If a partition is given 
along with separation protocols P, and P 2 for the two sub-bigraphs of H, we 
design a separation protocol P for H as follows: first Alice sends “i” if and 
only if her end of the true edge lies in A;, then Bob sends back a meaningless 
“1”, then protocol P; is followed. 

It remains to show that any symmetric class C containing the dis- 
connected bigraphs and closed under the stated operation contains all 
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deterministically separable bigraphs. This is done by induction on the number 
of edges. 

Let H be connected but deterministically separable via a communication 
protocol P; then sooner or later P must call for a first meaningful message, 
say from Alice. We may assume that her options at that point are to send “1”, 
“2” etc. up to “k” for some k > 2, according to whether her end of the true 
edge lies in E,, Eo, etc; there is no dependence on previous conversation here 
because by assumption said conversation has up to now been predictable. Let 
us modify P slightly by having Alice send only a 1 or 2 at this point, the 
former just when her end is in £1; if she sends a “2” Bob sends a meaningless 
“1” back, then Alice sends the “2”, “3”,...or “k” that would have been sent 
before and the protocol P resumes. We have thus found protocols which 
separate each of H|E, and H|(A(H)— F,). Each of these is thus in C by the 
induction assumption, hence H € C by the closure condition. 


4. Non-deterministic Separation 


We now introduce two ways of weakening the definition of a communication 
protocol. First, if the functions f; are permitted to be multi-valued, so that 
at each point Alice or Bob has one or more possible messages to send, we say 
that the protocol is non-deterministic. Note that during a non-deterministic 
communication protocol, the bigraph shrinks as before but this time each 
party, at his or her turn to speak, is provided with a (message-labelled) cover 
of his or her vertices instead of a partition. It is still the case, however, that 
the knowledge of Alice and Bob is expressed at each point by the state of 
their bigraph. 

Second, if the functions are permitted to depend for odd 7 on a random 
number m known only to Alice, and for even i on a random number n known 
only to Bob, then the protocol is randomized. Here the bigraph does not any 
longer describe the situation completely, as Bob and Alice may learn things 
about each other’s random number. 

Whether a communication protocol is non-deterministic, randomized or 
both, however, we continue to insist that the protocol always produce a 
successful conversation in order to qualify for separating H. 

Fortunately, the three new categories of separation protocol which arise 
result in only one new category of bigraph. 


Theorem 3. The following are equivalent, for any bigraph H: 


(a) H is separable by a non-deterministic communication protocol; 

(b) H is separable by a randomized communication protocol; 

(c) H is separable by a randomized, non-deterministic communication pro- 
tocol. 


On the Isolation of a Common Secret 29 


Proof. We need to show (c)—>(b) and (c)—>(a), the reverse implications being 
trivial. Of these the former is easy: by extending the range of the random 
numbers, Alice and Bob can use them to decide which message to send when 
there is more than one choice. 

Turning random numbers into non-deterministic choices looks awkward 
because a random number may be used many times in the protocol, whereas 
there is no “consistency” built in to nondeterminism. However, this problem 
is illusory. Suppose, at Alice’s first turn to speak, that she is supplied with a 
randomized separation protocol but no random number; then she chooses a 
random number and acts accordingly. She cannot “remember” that number 
at her next turn and use it again, but she can compute which random numbers 
are consistent with her previous action and choose one of those upon which 
to base her next message. Bob behaves similarly; at each turn he determines 
which values of his non-existent random number are consistent with his own 
previous actions (and his end of the true edge), then picks one such value 
and acts accordingly. 


A bigraph will be called, simply, separable if one (thus all) of the 
conditions of Theorem 3 obtains. The following recursive characterization 
is analogous to Theorem 2, although a small additional subtlety arises in the 
proof. 


Theorem 4. Let S be the smallest symmetric class of bigraphs which 
contains the disconnected bigraphs and has the following property: for any 
bigraph H, if there is a covering A(H) = A,U Ag of Alice’s vertices such that 
H|A, and H|A2 are both in S, and both Ay and Ag are strictly contained in 
A(H), then H is also in S. Then S is the class of separable bigraphs. 


Proof. The proof that the class of separable bigraphs is symmetrical and 
closed under the operation defined in the statement of the theorem is as in 
Theorem 2, except that if Alice’s end of the true edge lies in AyM Az she may 
send either message “1” or message “2”. 

It remains to show that any symmetric class C containing the discon- 
nected bigraphs and closed under the stated operation contains all separable 
bigraphs; this is again done by induction on the number of edges. 

Let H be connected but separable via a non-deterministic communication 
protocol P, and suppose that H is the smallest separable bigraph not in C. 
Let us call a vertex u in A(H) (or, dually, in B(H)) weak if there is no proper 
subset A’ of A(H) containing u for which H|A’ is separable. 

We claim that there is some weak vertex in A(H). For, if not, define for 
each x € A(H) a proper subset A, which does yield a separable sub-bigraph 
of H. Since the A,’s cover A(H), we can find x71, x2,..., x, such that the A,,,’s 
cover A(H) with k minimal (but necessarily greater than 1). Set A, := Az,, 
Ag := Az, U:--UAz,. Then A; and A: area proper cover of A(H), and H|A, 
is separable by assumption. However, H| Ag is also separable, since Alice can 
reduce it to a separable bigraph by sending some 7 for which her end of the 
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true edge lies in A,,,2 <i < k. These bigraphs are thus both in C by the 
induction assumption, contradicting the fact that H is not in C. 

Since the class of separable bigraphs is symmetric, the dual of H is also 
separable but not in C; hence the same argument produces a weak vertex 
in B(H). It may seem to the reader that weak vertices cause trouble only if 
found on the true edge, and thus that Alice and Bob can’t both be stymied as 
long as no two weak vertices are adjacent. However, it turns out that the mere 
presence of weak vertices on both sides is enough to render H inseparable. 

To see this, let wu be a weak vertex in A(H); there must be some message 
(say m) which Alice is permitted to send when her end of the true edge is 
u. Let A’ be the set of vertices in A(H) which, like u, allow the message m; 
then H|A’ must be separable, since this is the bigraph which results when m 
is sent. Thus A’ must not be a proper subset of A(H), that is, A’ = A(H) 
and the message m is meaningless. 

If m is indeed sent, the protocol turns to Bob who still has all of H 
before him. By the same reasoning as above, he must also have a meaningless 
message (i.e. a message he can send regardless of which vertex is his end of 
the true edge) available to him. 

Now we’re back to Alice with H still intact. We thus see that Alice and 
Bob must be allowed by the protocol to pass meaningless messages back 
and forth ad infinitum, irrespective of which edge of H is the true edge; but 
then we have a contradiction, since H is required to have a communication 
protocol which always separates. 


Theorem 4 is often useful in determining separability via case analysis. 
For example, it is easy to check that no path with fewer than five edges is 
separable, nor is the 6-cycle (Fig. 2) separable because for any proper subset 
S of Alice’s or Bob’s vertices, H|.S would be a path of length 2 or 4. 

There is one class of bigraphs which is easily seen to be disjoint from the 
class of separable bigraphs, a fact which helps in obtaining negative results. 


Theorem 5. Suppose that there is an edge of H which is adjacent to all 
other edges of H. Then H is not separable. 


Proof. Such an edge cannot be contained in any disconnected sub-bigraph of 
H; thus, if it happens to be the true edge, no protocol can separate H. 


Note that, in particular, no separable bigraph can have a vertex which is 
adjacent to all the vertices on the other side. 

We are now in a position to prove that the class of separable bigraphs is 
strictly larger than the class of deterministically separable bigraphs. 

Let A(H) = {a1,...,a7} and B(H) = {b1,...,67}, and put (a;,b;) € H if 
and only if 7 —7 = 1, 2 or 4, where the indices are interpreted always modulo 
7. Then H is the incidence graph of a Fano plane (see Fig. 4) and we have: 


Theorem 6. The incidence graph of the Fano plane is separable but not 
deterministically separable. 
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Fig. 4 Incidence graph of the Fano plane. 


Proof. We first provide a non-deterministic separation protocol. Alice begins 
by sending a number k such that her end of the true edge is in the set 
{ak, Qk41,4%42}. Bob now (deterministically) sends back “1” if his end is 
bee or by46; “2” if it is byya or bg45; “3” if it is bp43 or be41. (It cannot be 
by.) This separates H into a 2-edge component and a 1-edge component. 

If, on the other hand, there were a deterministic separation protocol for H, 
then one of the parties would eventually have to send a meaningful message, 
thus effecting a partition of his or her vertices. This may as well be Alice 
since H is symmetrical. If one of the parts has fewer than 3 of Alice’s vertices 
in it, or has 3 vertices whose neighborhoods intersect, then one of Bob’s 
remaining points will be of full degree, contradicting Theorem 5. Otherwise 
the partition must be isomorphic to {a 1, a2, a3} versus {a4, a5, a6,a@7}. The 
former part induces a deterministically separable sub-bigraph as we have 
seen from the above protocol, but the sub-bigraph induced by the latter part 
contains a vertex (b,) whose neighborhood intersects the neighborhoods of 
all other vertices of B(H). Thus, if Bob’s end of the true edge is b1, he cannot 
separate the bigraph at this time. However, Alice is also stymied because she 
has only one vertex available (a5) not adjacent to 6;, thus her vertices can 
never be partitioned so as to induce disconnected sub-bigraphs. 


The situation changes if we consider bigraphs which are separable in one 
round, that is, by just one message from Alice. (This is not, by the way, a 
symmetric class; the path on 7 vertices, for example, can be separated only 
by a message from the party with 4 vertices.) Before proceeding, we need a 
curious graph-theoretic result. 
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Theorem 7. Let G be any graph with no vertex adjacent to all others. Then 
there is a partition V,,...,V_ of the vertices of G such that for each i = 
1,...,k the subgraph (V;) induced by V; is disconnected. 


Proof. Tf not let G be a counterexample with smallest possible number of 
vertices. For any subset U of the set of vertices V, let w(U) be the number 
of vertices in V — U which are adjacent to all other vertices in V — U. 

Note first that if (U) is disconnected, then w(U) must be non-zero; else 
we may apply the induction hypothesis to get a suitable partition of V — U, 
and appending U itself to this partition yields a partition of V suitable for G. 

Choices of U for which (U) is disconnected do exist, of course, since U 
can be taken to be a pair of non-adjacent vertices. Hence we may choose a 
U for which (U) is disconnected and w(U) is minimal. 

Now let x be any full vertex in V — U, that is, any vertex in V — U 
which is adjacent to all other vertices in V — U. By assumption x has some 
non-neighbor, say y, in V; let C' be the set of vertices of the component of 
(U) into which y falls. 

Suppose first that y is not the only vertex in C, and let W = U \ {u}. 
Then (W) is still disconnected, but x is no longer full in V — W since a 
non-neighbor y has “moved in.” Of course y is not full either, and any other 
vertex which is full in V — W must already have been full in V — U. Hence 
w(W) < w(U), a contradiction. 

We are reduced to the case where y is an isolated point of (U); now we 
let W = UU {a}. Since x and y are not adjacent y is still isolated in (W). 
Any full vertex of V — W was adjacent to x in V — U and therefore already 
full in V — U; but a itself is now gone from V — W and so we again have 
w(W) < w(U), and this contradiction proves the theorem. 


Note that the induction hypothesis, and thus the theorem itself, can be 
strengthened to read “each (V;) has at least two vertices and contains an 
isolated point” without changing the proof. However, we will not need the 
stronger statement. 


Theorem 8. The following are equivalent for a bigraph H. 


(a) H is separable in one round; 

(b) H is deterministically separable in one round; 

(c) For every vertex u of A(H) there is a verter v of A(H) such that the 
neighborhoods of u and v (in B(H)) are disjoint. 


Proof. It is enough to show (a)—(c)—(b). Suppose that H is separable in 
one round, let uw be any vertex of A(H), and let i be a message that can 
be sent by Alice when her end of the true edge is u. Since the bigraph that 
results from sending “i” is disconnected, there is a vertex v on Alice’s side 
of it which is in a different component from u; then u and v must originally 
have had disjoint neighborhoods. 
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Now suppose that (c) is satisfied and form a graph G on the vertices in 
A(#) by defining {u,v} to be an edge whenever u and v have intersecting 
neighborhoods. Condition (c) says precisely that G has no vertex of full 
degree, hence we may apply Theorem 7 to obtain a partition Vj,...,Vz of 
the vertices of A(H) each part of which induces a disconnected subgraph of 
G, hence also of H. Sending “i” when Alice’s end of the true edge lies in V; 
thus yields a deterministic separation protocol. 


We have said that a disconnected bigraph is sufficient to enable Alice and 
Bob to communicate a bit in secret; we are now in a position to show that 
separability is in fact necessary for such a communication, thus completing 
the reduction of the original cryptologic problem to a graph-theoretical one. 

Let us fix a bigraph H and suppose that Alice (say) has been supplied 
with a bit ¢ which she must communicate in secret to Bob, over our usual 
public channel. The effect of the bit is to double the vertices on Alice’s side 
of H; that is, each vertex a € A(H) now becomes a pair a(0),a(1) each with 
the same neighborhood that a had in B(H). The edge (a(0), b) corresponds 
to the original (a,b) together with the statement “e = 0”. 

At the conclusion of a successful, non-randomized communication proto- 
col the question “What is the value of ¢?” must be answerable by Bob but 
not by Eve, hence the bigraph must now be disconnected—and moreover 
(although we shall not need this fact) the vertices from A(H) in each 
component must either all correspond to ¢ = 0 or all toc = 1. 


Theorem 9. Alice can communicate a bit to Bob in secret, via a randomized 
and/or non-deterministic communication protocol, if and only if their bigraph 
is separable. 


Proof. Sufficiency has already in effect been demonstrated; Bob and Alice can 
cooperatively disconnect the bigraph, ignoring the bit value, then a message 
of the form “My bit is 0 if the true edge lies in component C, 1 otherwise” 
does the trick. 

For the converse, we double the bigraph as above so that the protocol may 
be regarded as a special communication protocol, which we denote by P. If P 
is randomized, then we may replace the random inputs by non-determinism 
as in Theorem 3; thus we may assume P is merely non-deterministic. 

Now we construct from P a randomized (!) communication protocol 
P’ which operates on the original, undoubled bigraph plus a single ran- 
dom bit a for Alice. P’ operates by the rule f/(x;a;m1,...,mi-1) = 
fi(a(a);mi,...,mi-1) for i odd, and f/ = f; for i even. 

At each stage, the bigraph associated with P’ will be precisely the image 
of the bigraph associated with P under the collapsing map ® which sends (0) 
and z(1) to x. But the image of a disconnected bigraph under this mapping 
is again disconnected, so P’ is a separation protocol for the original bigraph, 
completing the proof of the theorem. 
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Theorem 9 says, in effect, that if Alice and Bob know that they will need 
to communicate a bit in secret, then they can disconnect their bigraph in 
advance; when the bit comes in, it can then be communicated (in either 
direction) by a single message. 


5. The ‘Two Sheriffs’ Problem 


Let us now look now at the two sheriffs problem, but generalized as follows: 
one sheriff (whom we shall call “Lew” ) has narrowed his list of suspects to p, 
the other (“Ralph”) to qg, and the total number of suspects is n. The edges 
of the bigraph H here represent all possible pairs (LZ, R) of subsets of the set 
N = {1,2,...,n} of suspects, with |L| = p, |R| = q and |LN R| 4 0. Lew’s 
side A(H) of the bigraph will thus contain (") vertices and Ralph’s side (") 
vertices, adjacency arising when the corresponding subsets intersect. 

If Lew and Ralph succeed in determining the identity of the killer without 
tipping off the mob, they will share a secret and thus must have disconnected 
H. Conversely, suppose they manage to disconnect the bigraph; then Lew 
and Ralph can reduce further to two non-adjacent edges, one of which is the 
true edge. If that edge represents an overlap of one, the sheriffs will have 
found the killer. 


Theorem 10. [fn = 2pq then there is a deterministic separation protocol 
for solving the two sheriffs problem. 


Proof. We make use of Baranyai’s Theorem [1], which says the following: 

If & divides n then there is an array {Kij}, 1 <i < rn/k, 1 <j < 
(7)/(n/k) of subsets of N = {1,2,...,n} such that each |Kj,;| = k, each 
column Kyj,...,Ky/z,j is a partition of N, and each subset of N of size k 
appears exactly once in the array. 

Such an array (known as a 1-factorization of the complete k-uniform 
hypergraph on n vertices) is fixed by Lew and Ralph (publicly) for k = p, 
and Lew proceeds to tell Ralph on which column his end L of the true edge 
can be found. 

Ralph is thus presented with a partition 1,...,Dn/, of N, one of whose 
parts is Lew’s narrowed-down suspect set. His job will be to split the index 
set I = {1,2,...,n/p = 2q} into two parts, say J, and Jy, so that his suspect 
set R is contained in U,< 1, Li- This will disconnect the bigraph. 

To do this the sheriffs employ a fixed (but arbitrary) map ® from the set 
of subsets of I of size at most q to the set of subsets of J of size exactly q, 
such that ®(S)M S' = for every set S in the domain of ®. Ralph forms the 
set F := {i: ROL; # 0}, then puts F’ := ®(F) and F” := ®(F"). F’ and 
F" are thus complementary subsets of J of cardinality q; let I, be the one 
containing the element “1” of J, and let Iz be the other. Ralph now identifies 
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J, and Ig and announces that his set F’, defined as above, is contained either 
in I or Tp. 

The resulting bigraph will contain all vertices of B(H) for which the 
resulting F’ would have been contained in I; or Ig and would have had 
cardinality q, since in those cases F’ and F” are not dependent on the choice 
of ®. Those vertices for which F’ is contained in J, will form a connected 
component, for e = 1,2. 

Let k be the index of Lew’s end of the true edge, that is, let DL; be Lew’s 
suspect set; suppose k € I,. Let i be such that k is the ith smallest member 
of I, and let 7 be the ith smallest element of J3_.. Lew now announces that 
his set of suspects is in fact either L; or Lx. 

Ralph (but not the mob) will know which of the two is Lew’s suspect 
set: say it is L;. If |RL,| > 1 then Ralph announces that the killer cannot 
be identified; otherwise, however, he now knows the killer (say, 7). Choosing 
(again by order of numbers) the corresponding element y of Ly, he announces 
that the killer is one of x and y. This completes the protocol. 


Let us see how this works in the original case p = q = 2, n = 8. The 
following Baranyai array can be used: 


{1,2} {1,3} {1,4} {1,5} {1,6} {1,7} {1,8} 
{3,4} {2,4} {2,3} {2,6} {2,5} {2,8} {2,7} 
{5,6} {5,7} {5,8} {3,7} {3,8} {3,5} {3,6} 
{7,8} {6,8} {6,7} {4,8} {4,7} {4,6} {4,5} 


Suppose that the true edge is either ({1, 2}, {1,3}) or ({5, 6}, {5, 7}). Then 
Lew will announce that his suspect set belongs to the first column, that is, is 
one of {1,2}, {3,4}, {5,6} or {7,8}. If Ralph himself had one of these sets he 
would simply announce at this point that the killer cannot be identified; as it 
is, he splits the index set, telling Lew that his suspect set is either contained 
in {1,2} U {3,4} or in {5,6} U {7,8}. Lew now says “My set is either {1,2} 
or {5,6}” and Ralph comes back with “The killer is either 1 or 5”. 

Non-deterministic versions of the above protocol are more easily de- 
scribed; Lew merely picks some partition of which his suspect set is a part, 
and Ralph can reduce to two possible suspect sets whose intersections with 
the partition indices are complementary. Here just one more message, from 
Lew to Ralph, completes the protocol. Moreover, this can be made to work 
for any n > 2pq as well. 

However, we can do even better when non-determinism is permitted; for 
example, here is a non-deterministic separation protocol for solving the case 
where n = k?,p = (k — 1)? +1 and g =1, for any k > 2. 

Lew begins by choosing a k x k array {s;,;} of all the suspects, such that 
for some j’, Lew’s suspect set consists precisely of s; 1 and all s;,; such that 
i #7 and j £1. Ralph, who began knowing the identity of the killer, replies 
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as follows: if the killer is s;; for 7 A 1 and any 7, he says “The killer is either 
$1,; or 5;,;.” If the killer is some s; then he picks any 7 # 1 and makes the 
same statement. 

By first partitioning the suspects into possible vertices (as in the p, q, 2pq 
case) and then making a k x k array of the sort described above, but where 
the array elements are members of a partition instead of single suspects, we 
may combine the techniques for the following result: 


Theorem 11. The two sheriffs problem is solvable non-deterministically 
whenever n > q(1+V/p— 1)’. 


It is perhaps interesting to note that we have separated a very dense 
bigraph here, regular on each side. In fact, related to these are the following 
dense bigraphs, which are deterministically separable: fix a large k and set 
HT equal to 


{((a, b), (c,d)) : 1<a,b,c,d<k and either a = c and b = d, or a¥c and b4d}. 


To separate H, Alice’s announces the first coordinate of her pair and Bob the 
second coordinate of his. Then each will know whether their edge is based on 
the equalities or the inequalities in the definition above. 


6. Multi-Party Generalization 


It is evident that many of our definitions and results can be extended to the 
case where there are more than two conversants. In this case conversation 
protocols, in order to remain general, allow the identity of the next speaker 
to depend, at each turn, on the previous conversation. 

In [3] Fischer and Wright suggest using random deals to facilitate secret 
key exchange within a group of persons wishing to communicate privately in 
yet-to-be-specified subgroups. Among the negative results in [3] is a theorem 
(Theorem 9, p. 11) which states that no communication protocol for 3 players, 
each dealt one card of a 3-card deck, can enable them to isolate a secret bit. 
Fischer and Wright indicate that our methods can be used to generalize the 
result to & > 3 players; we show here how that can be done. 

The definition of “bigraph”’ extends easily as follows: a k-graph is a finite 
collection H of k-tuples © = (a1,...,2%) (which we call “blocks” to avoid 
confusion) such that «,y € H implies that x; and y,; are distinct for i 4 j. 
A k-graph is thus a particular special case of k-uniform hypergraph in which 
the sets {x; : x € H} partition the vertex set of H. 

The proof of Theorem 1 goes through, as does an appropriate version of 
Theorem 4; thus we are once more reduced to showing that the players, say 
X, through X;, cannot cooperatively disconnect their k-graph Hy, which in 
this case consists of a block for each permutation of the cards. 


Theorem 12. The “permutation k-graph” H;, is inseparable for k > 3. 
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Proof. Hz is of course separable, indeed disconnected to begin with. Let us 
assume that P is a (non-deterministic) separation protocol for H;, for some 
k > 2, and let H°,H',...,H* be the state of the k-graph for X),...,X, at 
each stage of some (successful) conversation using P. Then H® = Hj, and 
H®,...,H*-} are connected k-graphs. We claim first that Hé must consist 
only of two blocks, which up to permutation of the players and cards, may 
as well be 1,2,3,..., k and 2,3,..., k, 1. To see this let x and y be two blocks 
of H* which lie in different components; then in particular x and y are not 
adjacent so x; £ y; for i = 1,2,...,k. Let G be the graph on vertices 1,2,..., 
k with j adjacent to 7’ when {j, j’} = {x;, yi} for some 7; then since x and y 
are each permutations, G is regular of degree 2. If ¢ is an automorphism of 
G such that $(j) is adjacent to j for all 7 then ($(21),...,¢(ax)) is a block 
of H*® which is adjacent to x if ¢ has any fixed points and to y if ¢ is not the 
identity. Since no block of H* can be adjacent to both x and y, every such ¢ 
must fix all vertices or none; hence G consists of a single cycle. By relabelling 
we may assume x = (1,2,3,...,k) and y = (2,3,...,k,1). 

Now if H* contains any other block some player, say X,, must have 
another vertex, say 7 # k,1. But then the block (1,2,...,7 — 2,7 —1,j + 
1,j+2,...,4—1,&, 3) lies in H*; and it is adjacent to both x (at player X,) 
and y (at player X;,), a contradiction. This proves the claim. 

We may now assume that H* consists exactly of the above blocks x and 
y, and that the last player to speak was X,; then the vertices of H*~! are 
exactly those appearing in x and y, plus some additional vertices held by X, 
which she eliminated in her last message. The fact that H‘'~+ contains those 
additional vertices means that if one of them had been X,’s “true” vertex, 
the conversation might have gone exactly as it did until the last message. 
But H‘~! contains no pair of non-adjacent blocks other than x and y, since 
in every block not equal to x or y, player X2 holds a 2 and player X; holds a 
1. Hence, the protocol P has failed in this case and this contradiction proves 
the theorem. 


7. Final Comments 


In this work we have only begun to study the combinatorial cryptology of 
isolating a common secret. In J. Combin. Theory (B) 84 (2002) pp. 126— 
129, Nicole Portmann has shown that for any n, there are bigraphs that are 
deterministically separable but in no fewer than n steps; and that there are 
bigraphs that are non-deterministically separable but in no fewer than three 
steps. We still do not have a proof that the two sheriffs problem cannot be 
solved deterministically when n < 2pq. 

We hope that our “bigraphs” may prove to be a useful way of representing 
common knowledge, even for applications unrelated to the problem of 
isolating a common secret. Although they carry the same information as 
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do other representations, they may help attract graph-theorists to knowledge 
problems and thus bring some powerful theorems and sharp combinatorial 
minds to bear. 
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Summary. We answer the following question: Let P and Q be graded posets 
having some property and let o be some poset operation. Is it true that Po Q 
has also this property? The considered properties are: being Sperner, a symmetric 
chain order, Peck, LYM, and rank compressed. The studied operations are: direct 
product, direct sum, ordinal sum, ordinal product, rankwise direct product, and 
exponentiation. 


1. Introduction and Overview 


Throughout we will consider finite graded partially ordered sets, i.e. finite 
posets in which every maximal chain has the same length. For such posets P 
there exists a unique function r : P + N (called rank function) and a number 
m (called rank of P), such that r(a) = 0 (r(x) = m) if x is a minimal (resp., 
maximal) element of P, and r(y) = r(x)+1 if y covers x in P (denoted x<y). 
The set Pi) :-= {@ € P: r(x) = +} is called i-th level and its cardinality 
|Po)| the i-th Whitney number. If S is a subset of P, let r(S) := )o,eg7(2), 
in particular r(P) := )0.-¢p r(x). Let us emphasize, that r(P) is here not the 
rank of P. 

A symmetric chain is a chain of the form C = (a <21 <---<a,), where 
r(t9) +r(a;) =m. A subset A of P is called a k-family, if there are no k +1 
elements of A lying on one chain in P. Further, F' C P is called a filter, if 
y > « € F implies y € F, and I C P is said to be an ideal if y< «a EI 
implies y € I. Let d,(P) := max{|A|: A is a k-family} and let w;(P) denote 
the largest sum of k Whitney numbers. Obviously, wz(P) < dg (P), for k > 1. 
The poset P is said to be: 


(i) Sperner (S), if di(P) = wi(P), 
(ii) a symmetric chain order (SC), if P has a partition into symmetric chains, 
(iii) Peck, if d,(P) = wr(P), for k = 1, and |Poo)| = |Pon)| a |Pay| = 
IPom—a)] S00 S|Pam2pl = Permsaryh 


R.L. Graham et al. (eds.), The Mathematics of Paul Erdés I, 39 
DOI 10.1007/978-1-4614-7254-4_4, 
© Springer Science+Business Media New York 2013 


40 Sergei L. Bezrukov and Konrad Engel 


$., B 


Fig. 1 The property S is not preserved by direct product. 
P + Q 
Fig. 2 The properties S, LYM and RC are not preserved by direct sum. 


<1 for every antichain A of P, 


(v) rank compressed (RC), if up := [4 > ") =: yp for every filter F 4 0 


IFT = [P| ~~ 
of P. 


(iv) LYM, if Deca eS 


Since F is a filter if and only if P\F is an ideal, one can define equivalently: 


(v)’ P is rank compressed, if zy := Lf < up for every ideal I £ 0 of P. 


Extensive information on these properties can be found in [5]. Further 
related properties are studied in [7]. 

Already in 1945 Paul Erdos [6] proved that finite Boolean lattices are 
Peck and thus initiated comprehensive investigations on this subject. 

In the following we will study which of these properties are preserved by 
usual poset operations, i.e. the question is: if P and Q have some property, 
is it true that Po Q has this property as well? Here o is some operation. 

Throughout let m (resp., n) be the rank of P (resp., Q). If it is not clear 
from the context whether r is the rank function of P, Q, or Po Q we will 
write rp, TQ, TPoq, respectively. 

A widely studied operation is the direct product P x Q, i.e. the poset on 
the set {(z,y) :  € P and y € Q}, such that (x,y) < (2’,y’) in Px Q if 
& <p wv’ and y <q y’. It is well-known, that the direct product preserves 
the properties SC (de Bruijn et al. [2] and Katona [10]), Peck (Canfield [3]), 
and RC (Engel [4]), and it does not preserve the properties S and LYM (but 
with additional condition it does (Harper [8] and Hsieh and Kleitman [9]), 
see Fig. 1. 

A simple operation is the direct sum P + Q, i.e. the poset on the union 
PUQ, such that « < yin P+ Q if either x,y € Pand az <py,orz,yEQ 
and x <g y. In order to obtain again a graded poset we will suppose here 
m =n. Then it is easy to see, that SC and Peck properties are preserved, 
but S, LYM, and RC not, see Fig. 2. 
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Fig. 3 The properties SC and Peck are not preserved by ordinal sum. 


mo. 


Fig. 4 The properties SC and Peck are not preserved by ordinal product. 


Another easy operation is the ordinal sum P @ Q, i.e. the poset on the 
union PUQ, such that x < yin P@Q, ifz,ye Panda <py,orz,yeEQ 
and x <g y, or z € Pandy € Q. To draw the Hasse diagram of P@Q, put Q 
above P and connect each maximal element of P with each minimal element 
of Q. Then it is obvious, that properties S and LYM are preserved (note that 
any antichain in P @ Q is either contained completely in P or completely in 
Q), and also property RC is preserved (see Theorem 1), but properties SC 
and Peck are not preserved, see Fig. 3. 

An interesting operation is the ordinal product P ® Q, i.e. the poset on 
the set {(z,y) : « € P and y € Q}, such that (2, y) < (2’,y’) in P@Q, if 
x= andy <g y’, oraz <p 2’. To draw the Hasse diagram of P®Q, replace 
each element « of P by a copy Q, of Q, and then connect every maximal 
element of @, with every minimal element of Q, whenever y covers x in P. 
In Theorem 2 we will prove, that properties S, LYM, and RC are preserved. 
Figure 4 shows that properties SC and Peck are not preserved. 

Studying posets like square submatrices of a square matrix, Sali [11] 
introduced the rankwise direct product P x, Q. We will suppose here again 
m =n. Then P x, Q is the subposet of P x Q, induced by Uj" 9 Puy x Qa). 
Sali [11] showed, that properties SC, Peck, and LYM are preserved and gave 
an example that property S is not preserved. Here we present an example, 
which shows that also property RC is not preserved. Look at the poset P of 
Fig.5, which is easily seen to be rank compressed. 

The indicated elements form a filter F’. Now it is easy to see that the 
filter F x, F in P x, P does not verify the filter inequality of (v). 

Finally, we will consider also the exponentiation Q?, i.e. the poset on 
the set of all order-preserving maps f : P > Q (that is, x <p y implies 


42 Sergei L. Bezrukov and Konrad Engel 


P 


Fig. 5 The property RC is not preserved by rankwise direct product. 


f(x) <@ fly)), such that f < g if f(x) <@ g(x) for all « € P. In Theorem 3 
we will prove that none of the five properties is preserved. 


2. Main Results 


Theorem 1. If P and Q are rank compressed, then P © Q is also rank 
compressed. 


Proof. Obviously, if y has rank 7 in Q, then it has rank:+m+1in POQ. 
Hence peg = a er Let I be an ideal in P@Q andI # 0. 
Case 1. Assume IN Q = Q. Since wp < pag, and py < wp as P is rank 
compressed, it follows pz < pea. 

Case 2. Now let INQ #0. Then P C I and I := QM is an ideal in Q. One 
has |J| = |P|+|Z|, r(2) =rp(P)+re(L) + (m+1)|Z]. Therefore, ur < wpag 
is equivalent to 


(IZrq(Q) — Qlra(D) + 1@\1|((m + YIP| — r(P)) + |Pir(Q\F) = 0. 


This inequality is true, since Q is rank compressed and any element of P 
has rank at most m. 


Theorem 2. If P and Q are Sperner or LYM or rank compressed, then 
P®Q is also resp., Sperner or LYM or rank compressed. 


Proof. Let A be an antichain in P®Q. Denote A; = {y €Q: (x,y) € A} 
and A= {ae P: A, #0}. Obviously, A and A, are antichains in P and 
Qz, respectively. 

If P and Q are Sperner, then 


JA] = 32 [Aa] < So wi(Q) = |Alwr(Q) < wi(P)wi(Q) = wi(P @ Q), 
ceA reEA 
hence P ® Q is Sperner. 


Let P and Q be LYM. The level containing (x,y) has |P((x))||Q¢r(y))| 
elements. We have 
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1 7 1 


(P ® Q) ray) Perey) Q(r(y)) | 


(x,y)EA (z,y)EA 


Finally, let P and Q be rank mee Let I be an ideal in P® Q and 

A be the set of maximal elements of J (note that A is an antichain). We use 

ve notation A from above and define further I, := IN Qe, Fe i= Qe\Ie, 

={xe P:1, #0}. Then J, (F,) is an ideal (resp., a filter) in Q, and I 
7 an ideal in P. It is easy to see that: 


Z| = [LIlQ| — So Fel, 


xeEA 


r(I) = |Elre(Q) + (n+ IIQlre(1) — SO (re(Fx) + (n+ D/Felrp(2)), 


recA 
IP 8 Q| = |PIIQ|, 
r(P ® Q) = |Plra(Q) + (n + 1)/Q|rp(P). 


Now oe < eae if and only if 


IPI 0 (\Felre(@) — |@lrq(Fx)) 


ceA 


< (n+ 1/Q| | |Q|(\Z|re(P) - |Plre(D) + > (\Plre(@) — re(P))|Fo| 


reA 


The LHS is not greater than 0 since Q is rank compressed. So it is sufficient 
to verify, that the term in the big parentheses on the RHS is not smaller 
than 0. Denote A’ = {x € A: |P|rp(a)—rp(P) < 0}. Since one can omit the 
positive summands in the formula and in view of |F,,| < |Q..| it is enough to 
show that 


\Zirp(P) — |Plrp(B) + (|Pire(4’) - |A'Ire(P)) 2 0, 
which is equivalent to 
\\A'Irp(P) - |Plrp(Z\a’) > 0 


This inequality is true, since P is rank compressed and [\A’ is an ideal 
in P. 


Let P! be the direct product of | copies of P. The investigation of rank 
compressed posets was initiated by the following result of Alekseev [1]: 


P is rank compressed iff d,(P!) ~ w;(P') as 1 > ov. (1) 
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Fig. 6 The properties 5, SC, Peck, LYM and RC are not preserved by exponenti- 
ation. 


Moreover, from the Local Limit Theorem of Gnedenko one can easily derive 


P l 
wi(P!) ~ sso if P is not an antichain, 
where o?, = Tal > r?(x) — p2 (see [5)). 
ZEeP 


Remark 1. Straight-forward computations give us the following results: 


Be oe 2 
TPxQ —FPTI%|) 


oheq = 7) + (n+ 1), 


2 |P||Q| ( \ iL 


' rsa! 


"P62 ~ (PL + Qn? 


|Ploz + |Q|od). 


Theorem 3. The exponentiation does not preserve any of the properties S, 
SC, Peck, LYM, or RC. 


Proof. First take P and Q from Fig. 6. 

Here we mean that the complete bipartite graphs Ky are formed on the 
indicated vertices. 

Obviously, P and Q have properties S, SC, Peck, LYM, and RC. Now QP? 
is isomorphic to the subposet of @ x Q induced by the set {(x,y) : 2 <q y}. 
Consider the ideal 


I:={(2,y) € Q? :a <i and y <i for some i=1,...,t}. 
Easy calculations give us 


Ot? + 21t 


= ———— and = 4, 
TU 7 aaa 


LI 
But ur > war if and only if t > 8. Hence, Q? is not rank compressed if 
t > 8, and consequently does not have properties SC, Peck, LYM, since these 
properties imply property RC (see [5]). 
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Finally let t > 8 and denote P’ = P+---+P (J times). Again, P’ has 
all of the properties above. It is known (see Stanley [12]), that QP+ +? & 
QP x-+-x QP, hence QP’ = (Q?)!. Since Q? is not rank compressed, by (1) 


a (Q”) = a1 ((Q7)') > wi((@?)') = (Q”) 
if | is sufficiently large. Thus, QP "is not Sperner. 


Concerning the exponentiation, let us mention that if Q is a distributive 
lattice, then so is Q”. Since distributivity implies rank compression (see [5]), 
in a lot of cases the exponentiation provides a rank compressed poset. In 
particular, if Q is a two-element chain, Q? is isomorphic to the lattice of 
ideals of P, which is consequently rank compressed for any poset P. 


3. Summary 


In the following table we have summarized which of the considered properties 
are preserved and which not: 


PéQ 


Sperner 
Symm. chain 
Peck 

LYM 


Rank compr. 
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Summary. The random graph order P,,y is obtained from a random graph Gy,» 
on [n] by treating an edge between vertices 7 and j, with 7 < j in [n], as a relation 
i <j, and taking the transitive closure. This paper forms part of a project to 
investigate the structure of the random graph order Pp,» throughout the range of 
p = p(n). We give bounds on the dimension of P,,» for various ranges. We prove 
that, if ploglogn — oo and € > 0 then, almost surely, 


logn . 4logn 
1+e),/—-— < dimPh,» < (1+ 6),/——_.. 
OF iet7a S S49 sTog(/a) 


We also prove that there are constants ci,c2 such that, if plogn — 0 and p > 
log n/n, then 


cp < dim Phip < cop. 


We give some bounds for various other ranges of p(n), but several questions are left 
open. 


1. Introduction 


The random graph order Py,» is defined as follows. The vertex set is [n] = 
{1,...,n}: we use the symbol ~ to denote the standard linear order on this 
set. We take a random graph G,,, on [n], and interpret an edge between 
vertices i and j with i < j as a relation 7 < j of the order. (For the theory of 
random graphs, founded by Erdés and Rényi [10, 11], see Bollobas [4].) The 
full order < is then defined by taking the transitive closure. Put another 
way, if 7 < j, then we have i < j in P,,, if and only if there is a ~<- 
increasing sequence i = 21,%2,...,4m = Jj of vertices such that each of 
7112, 1913,.-.,%m—1tm is an edge of the underlying random graph. 

In general, the probability p will be a function of n. We say that P,, 
has a property almost surely if, as n + oo, the probability that P, y(n) has 
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the property tends to 1. Throughout the paper, any inequalities we state are 
only claimed to hold for n sufficiently large. Also throughout the paper, we 
set q=1-p. 

Random graph orders have been studied by Barak and Erdés [3], Albert 
and Frieze [1], Alon, Bollobds, Brightwell and Janson [2], Newman and 
Cohen [17], Newman [16] and Simon, Crippa and Collenberg [18]. The first 
three of these papers deal exclusively with the case where the probability p 
is constant; the next two are concerned with the height of the random order, 
and the final one is essentially concerned with the number of incomparable 
pairs of elements. See also the survey by Brightwell [8]. This paper is part of 
a project to investigate the structure of the random graph order more fully 
throughout the range of p = p(n). The authors have produced a paper [5] 
dealing with the width of P,,, throughout the range, and a further paper [6] 
on the general structure of random graph orders. 

This paper is concerned with the dimension of P,». Recall that a realiser 
of a partial order (X,<) is a set of linear orders on X whose intersection 
is exactly (X,<). The dimension of (X,<) is the minimum cardinality of 
a realiser. Alternatively, the dimension is the minimum d such that (X,<) 
can be embedded in R?@ with the co-ordinate order. Trotter’s book [21] is a 
thorough treatment of dimension theory for partial orders. The dimension 
has already been studied to some effect for another model of random orders 
by Erdé’s, Kierstead and Trotter [9]. We discuss there result here, partly 
because we shall use it later, and also because it motivates our approach to 
dimension. 

Define a random bipartite order By,» by taking a random bipartite graph 
on the union of two disjoint sets V and W of vertices, with |V| = |W| = n, 
and joining each pair of vertices (v,w) € V x W with probability p = p(n), 
independently. Then the relations of B,,, are exactly the relations v < w, for 
a vertex uv € V adjacent to a vertex w € W. 

The dimension of B,,, is certainly at most n, since the dimension of any 
partial order is bounded above by its width (see, for instance, Trotter [21]). 
The result of Erdés, Kierstead and Trotter [9] implies that, even if p is fairly 
small, the dimension of B,,,, is almost surely almost as large as n. 
Theorem 1. For every « > 0, there is a 5 > 0 such that, if log'**n/n < 
p<1—n71**, then, almost surely, 

; dpn log pn 
dim Bnip > 1+ dplog pn’ 


In particular, their is a constant c such that, if 1/logn < p<1—n-'/?, 


then, almost surely, 
dim By» > n(1 — c/plogn). 


The proof of Theorem 1 can be found in the original paper [9], or in 
Trotter’s book [21, Chap. 7]. 
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Theorem 1 suggests a method of proving that a partial order has large 
dimension; namely that one should try to find a large “random” bipartite 
order within it. We shall see this idea applied in due course. 

We also note here a powerful upper bound for the dimension, due to 
Fiiredi and Kahn [12] (see also Trotter [21, Chap. 7]). The mazimum degree 
of a partial order (X,<) is the maximum, over all vertices « € X, of the 
number of vertices of X comparable with 2. 


Theorem 2. Let (X,<) be a partial order with maximum degree A. Then 
dim(X, <) < 50A log? A. 


A random bipartite order with p = 2c/logn, where c is the constant of 
Theorem 1, almost surely has dimension at least eAlog A, for some fixed 
€ > 0 where A is the maximum degree. 

Returning now to random graph orders, the only previous result concern- 
ing the dimension is due to Albert and Frieze [1], who showed the following, 


Theorem 3. Let p be a fixed constant with 0 < p <1, and setq=1-p. 
Then, for any « > 0, almost surely, 


logn . 2logn 1 
iba(t/pa) < dim Php < Tog(1/q) + 5 +e. 

Thus if p = q = 1/2, the upper bound is essentially twice the lower 
bound; and the ratio of upper to lower bounds increases as p becomes smaller. 
The upper bound in Theorem 3 is an immediate consequence of the same 
upper bound for the width of P,,, as proved by Barak and Erdos [3] (and 
subsequently sharpened by Bollobas and Brightwell [5] w). The lower bound 
is obtained by showing that a specific partial order of the given dimension 
almost surely exists as a suborder of P,,. Albert and Frieze conjectured 
that this lower bound was correct. One of our principal aims here is to 
improve both the upper and lower bounds in Theorem 3, thus disproving 
that conjecture. We prove the following result. 


(1 —€) 


Theorem 4. Suppose p(n) is a function of n such that ploglogn — oo and 
p<1-1/Vlogn. For any ¢ > 0 we have, almost surely, 
logn ; Alogn 
(1 — €), | —>—~ < dimP,,, < (1+ €),/ ——-.. 
log t/a) e SOF OT Sog( a) 

The gap between the upper and lower bounds for the dimension of P,, 
is thus reduced to a multiplicative factor of (1 + €)2//3, for each constant 
value of p, and indeed whenever p(n) tends to 0 more slowly than 1/ log log n. 
We strongly suspect that the lower bound here is essentially correct, at least 
for constant p, but have been unable to prove it. Section 2 is devoted to a 
proof of Theorem 4: in fact, we prove slightly more general results. 
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Our other main aim is to estimate the dimension for rather lower ranges 
of p = p(n). In [5], we showed that there is a type of “phase transition” as 
p(n) passes through the range p = c/logn; this is investigated further, and 
to some extent explained, in [6]. There is a second, equally radical, change of 
behaviour near p = log n/n, the nature of which will hopefully become clear 
in the course of our proofs. A third transition phase, as p decreases through 
c/n, is readily understood, since the underlying random graph changes from 
being almost all in one component, to being a collection of small trees. The 
following result of Bollobaés and Brightwell [5] gets close to pinning down 
the behaviour of the width of P,,, between the first and last of these three 
transitions. 


Theorem 5. [f plogn — 0 and pn — ov, then the width of Pry» almost 
surely lies between 1.455p—! and 2.428p7'. 


We thus have an upper bound for dim P,,,, of 2.428p~' in this range. It is 
natural to ask whether this is roughly the right order of magnitude: we shall 
prove that it is, provided we are above the second phase transition point. The 
main result of Sect. 3 is as follows. 


Theorem 6. There is an € > 0 such that, if p > logn/n and plogn > 0, 
then, almost surely, 


ep! < dim Ppp < 2.428p~". 


Below p = logn/n, the maximum degree of the random partial order is 
smaller than p~!, so the Fiiredi-Kahn bound, Theorem 2, gives us a better 
upper bound on the dimension. We show that, at least in some range below 
p = logn/n, this upper bound is not too far off being correct. 


Theorem 7. There are positive constants c,,Cc2 such that, if 4 log n/n < 
p < (logn — loglogn)/n, then, almost surely, 


eye?” < dim Php < coe?” log? n. 


In particular, there is a genuine change of behaviour near p = logn/n, 
which is associated with a sudden drop in the value of the maximum degree 
from en to o(n). A later result, Lemma 1, gives some more information on 
the distribution of the number of vertices above a given vertex in P, py. 

The proofs of Theorem 6 and the lower bound in Theorem 7 are practically 
identical, and will be dealt with together in Sect.3. The constant 4/5 
appearing in Theorem 7 can be improved, and indeed it appears that, with 
some difficulty, it can be replaced by any « > 0. The upper bound in 
Theorem 7 remains valid for much smaller values of p, and it would be of 
some interest to determine if a lower bound of the form given in Theorem 7 
holds further down. The following easy result deals with very small values 
of p. 
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Theorem 8. (i) If pn > 0, but pn7/® — oo, then, almost surely, 
dim Py» = 3. 
(ii) If pn®/® — 0, then, almost surely, dim Pup 2: 


Proof. This result is an immediate consequence of well-known structural 
results for random graphs G,,,, with pn — 0 (see, for instance, Bollobas [8}). 

If pn > 0, then all the components of the underlying random graph are 
trees, so the dimension of the random order is at most 3 (see, for instance, 
Trotter [21, Chap. 5, (2.6)]). If pn7/® — oo, then some of the components are 
seven-vertex trees of dimension 3, whereas if pn7/® > 0, all components are 
trees of at most six vertices, so dim P,, , = 2. 


If p = c/n with c < 1, then all components are either trees or unicyclic 
graphs The following conjecture would imply that Theorem 8(i) can be 
extended to this range of p(n). 


Conjecture 1. If P is a partial order whose covering graph is unicyclic, 
then dim P < 3. 


As pointed out to us by Tom Trotter, Theorem (4.4) of Chap. 4 of [21] 
implies that, if P is as above, then dim P < 5. What happens if p = c/n with 
c>1? 

The reader will perhaps have noticed that there is one intermediate range 
of p(n) that we have not yet discussed, namely the range where plogn — oo 
but ploglogn —> 0. In this range, the width of P,p is still almost surely 
about \/2p—!logn and indeed the upper bound for dim P,,,, in Theorem 4 
still holds. Our methods can be adapted to give apparently rather weak lower 
bounds for the dimension, but we have been unable to answer the following 
question. 


Question 1. Is there ane > 0 such that, if p > logn/n, then, almost surely, 
dim Pr,» > ¢ Width(P,,»)? 


Note that we have an affirmative answer if either plogn — O or 
ploglogn — oo. However, we suspect that the answer is “no”. 


2. Large p(n) 


This section is devoted to the proof of Theorem 4. In fact, we shall prove two 
slightly sharper and more general results, which together imply Theorem 4. 
We first consider the lower bound. 


Theorem 9. There is a constant k such that, if 2/loglogn < p < 1—- 
1/Jlogn, then, almost surely, 
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: logn k 
dim Ph» = 1 F 
i a log(1/q) ( sien) 


Proof. We shall use Theorem 1, due to Erdés, Kierstead and Trotter [9]. Set 


_ logn 1 
u log(1/q) (: =i 


and note that m < /lognloglogn. Our aim is to find a “random By,” 
inside our random graph order P,,,y, and to apply Theorem 1. 

We break our vertex set [n] into 1 = |n/2m] consecutive pieces Ay,..., Au, 
so that A;+1 consists of the vertices between 2mj +1 and 2m(j+1) inclusive. 
Each set A; thus consists of 2m vertices; we subdivide it further into two 
consecutive sets 6; and C; of m vertices each, so that B;,, consists of the 
vertices between 2mj + 1 and 2mj +m inclusive, and Cj+4; contains the 
remainder. 

Note that, if there are no edges of the underlying random graph inside 
a set B; or Cj, then that set forms an antichain of size m in P,,. We call 
a set A; good if both B; and C; form antichains. Note that, conditional on 
A; being good, the random order P,,, restricted to A; is distributed as a 
random bipartite order B,,,, (with vertex classes B; and C;). 


The probability that A; is good is equal to q?(2) = q™. Also, the events 
that the various A; are good are independent. Thus the probability P that 


there is no good A, is at most 
(aa). 


Substituting our chosen value for m, and using the inequality | > n/3m, we 
obtain: 


PZ exp (— soa metifan O-1/ lesen)" 
Mm 


< exp 1 nil log log n 
3V/log n log log n 
= o0(1). 


Therefore there is almost surely some good A;. Thus we have, almost surely, 
dim P,» > dim Bmp. 

Note that 1/logm < p< 1—m7"/?, so, by Theorem 1, we almost surely 
have 


dim P,, > dim Bm» >m (1 ) 
plogm 


logn 2 3c 
> 1 1 , 
log(1/q) log logn ploglogn 
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where c is the constant of Theorem 1. The result now follows on choosing an 
appropriate value of k. 


We now turn our attention to the upper bound. The main point here is 
that the dimension is bounded away from the width. Informally, very large 
antichains are rare, and one needs more than one large antichain to have 
large dimension. 

Furthermore, for p above our first transition phase around 1/logn. it is 
shown in [6] that the random graph order has the structure of many small 
orders arranged one above another. Thus we shall make use of the following 
two results. 


Theorem 10. Let A be a maximal antichain in (X,<), with A A X. Set 
U={xeEX:2><a for somea€ A} andD={x EX: a <a for some 
a € A}, so that X is the disjoint union of A,U and D. Let Wy and Wp denote 
the widths of the partial order (X,<) restricted to U and D respectively. Then 


dim(X,<) <1+ Wy + Wp. 


Trotter [19] proves Theorem 10 in the cases where (i) Wp = 0 or (ii) 
Wp = Wu, and in fact the proof of the latter result generalises immediately 
to give Theorem 10. This proof can also be found in Trotter’s book [21, 
Chap. 1 (11.3)). 

A post in a partial order is an element comparable with all others. Posts 
in P,,» are considered in Alon, Bollobas Brightwell and Janson [2], and in 
more detail in Bollobas and Brightwell [6]. Assume for the moment that P,,» 
has at least one post. If 7 and j are posts in P,,, such that there are no posts 
k with i <~ k ~ j, then the partial order restricted to the interval [i, 7) of [n] 
is called a factor of P,y. If i is the first and 7 the last post, then the partial 
order restricted to either of [1,7) or [j,n] is also regarded as a factor. If the 
factors of Pn,» are Fi,..., Fm, with Fy ~ ... < Fm in [n], then Py» is the 
linear sum of F,,..., Fim, i-e., the partial order defined by taking the union 
of the F;, and putting in each relation of the form x < y, where x is an a 
strictly lower factor than y. Note that the dimension of a linear sum is equal 
to the maximum of the dimensions of its constituent factors. 

Set n(p) = []72,(1 — ¢’). We note that there is a constant c such that, 
for p < 1/2, n(p) > cexp(—7?/6log(1/q)). This can be read out of Hall [13, 
equation 4.2.11]. We quote the following result from [6]. 


Theorem 11. Suppose plogn > co. Then there is almost surely no set of 
n(p)~! log? n consecutive vertices of [n] that does not contain a post in Ph». 


We shall apply this plogn — 00, so that (p)~! log? n = o(n*) for every 
€ > 0. The significance of Theorem 11 is then that the partial order P,p is 
almost surely the linear sum of factors, all of which have at most 7(p)~! log? n 
elements. In view of Theorem 10, this implies that, if P,» does have dimension 
at least d, then one of the factors must contain both an antichain A of size 
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d, and a pair of disjoint antichains, also disjoint from A, of sizes summing to 
at least d—1. 

Our plan is to estimate the expected number of occurrences of such 
structures. 


Theorem 12. Let ¢ > 0 be any fixed constant. Suppose plogn — oo. Then, 
almost surely, 


4logn 
3log(1/q) 


Proof. In [5], it is proved that the expected number of antichains of size w 


dim Pr» < (1+ €) 


whose ~<-first vertex is x is at most n(p)~ 22). One can readily adapt the 
argument there to show that, for any triple (x, y, z) of vertices, the expected 
number of triples of antichains of size w,, w2,w3 with ~-first vertices 2, y, z 
respectively is at most 


n(p) Sq?) + C2 )+ C8) : 


The number of triples of vertices (x, y, z) in the same factor of P,,, is almost 
surely at most n(n(p)~! log? n)? from Theorem 11. 

Hence, by Theorem 10, the expected number of factors of dimension at 
least d, conditional on the conclusion of Theorem 11 holding, is at most 


The largest term in the sum is the term with i = |(d — 1)/2], so this 
expectation is at most 


6 gt /2+2(d/2)?/2—2d 6 gt /4-24, 


nn(p)* (log n) = nn(p)~* (log n) 


Setting d = (1+ €)\/4logn/3 log(1/q), for any € > 0, we see that this is at 
most 


ni O4+9"n(p) 8 (log n)"e~ VP" 8" = o(1). 


Since the dimension of P,,p is at most the maximum dimension of its 
factors, this implies that, almost surely, dim P,,, <d, as required. 


As mentioned earlier, we do not expect that the constant 4/3 in 
Theorem 12 is correct. One possible way to decrease it would be to try 
to improve on Theorem 10, under suitable extra hypotheses. Although 
Trotter [20, 21] has shown that Theorem 10 is sharp, even when Wp = Wy, 
the example he constructs looks completely unlike anything that is likely 
to arise as a suborder of P,,,: in particular the size of the antichain A is 
exponential in Wp. On the other hand, one can almost surely find in Py» 
three consecutive sets of vertices of sizes w/2, w and w/2 successively, where 
w = (1—€)\/4log n/3 log(1/q); however the (random) order induced by these 
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sets almost surely has dimension at most w/2—see Erdés, Kierstead and 
Trotter [9]. 

We suspect that the lower bound in Theorem 9 gives the correct constant, 
at least for constant values of p, so we make the following conjecture. 


Conjecture 2. Let p be a fixed constant with 0 < p< 1. For every « > 0, 
we have, almost surely, 


. logn 
dim Php < (1 +¢) log(1/q)’ 


Our principal aim in this section is to prove Theorem 6, giving a lower bound 
of ep~! on the dimension for the major range of p(n). As mentioned earlier, 
this bound is best possible up to the value of «. We shall work in slightly 
greater generality so as to prove Theorem 7 as well. 

We start with an overview of the proof, assuming for simplicity that 
p > logn/n + w(n) loglogn/n, where w(n) — oo. Here and later, we treat 
integer and real quantities interchangeably, for the sake of clarity: the proof 
is not materially affected. 

Throughout this section, we set m = p—!. We restrict attention to the 
set [N] consisting of the <-first N = mlogm + 6m vertices of [n], so we 
are dealing with a random graph order Py,,. We partition [N] into sets 
W <U <U' <W’, with |U| = |U’| = $mlogm, and |W| = |W’| = 3m. Let 
A be the set of maximal elements of the partial order restricted to W, and 
let A’ be the set of minimal elements in the partial order restricted to W’. 
It was noted in our earlier paper [5] that, almost surely, |A],|A’| > 9m/10. 
Our aim is to prove that the random order Py,» restricted to AU A’ almost 
surely has dimension at least em. 

Intuitively, the size of U UU',mlogm, has been fixed so that, for each 
pair (a,a’) € Ax A’, the probability that a < a’ in Py,» is a constant. 
If these relations were independent, then the partial order we consider 
would be a random bipartite order, and, by the result of Erdés, Kierstead 
and Trotter [9], Theorem 1, would almost surely have dimension at least 
(1 — €)9m/10. However of course the relations are far from independent. We 
overcome this, at the expense of finishing with a rather small constant, in the 
following manner. We prove that, almost surely, there are at least ym? pairs 
(a, a’) € Ax A! which are not related in Py,». If we have a realiser Ly,..., La 
of Py», then, for each such unrelated pair, there must be an i € {1,...,d} 
such that a is above a’ in L;—we say L; reverses (a, a’). On the other hand, 
we prove that, for any subset T of A of (small) size t, the number of elements 
of A’ incomparable with all elements of T is at most (1 — 6)'m. This will 


3. Small p(n) 
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imply that the total number of pairs (a, a’) reversed by a linear extension DL 
is at most 2m/d, and so we must have dim Py, > d > ydm/2. 

For X a subset of [N] and B a subset of A, let ['x(B) be the set of 
vertices x € X such that x is above some vertex of B in Py,p, i.e., there is 
a ~<-increasing path in the underlying random graph from some vertex of B 
to x. Our strategy will be to prove that, for every (small) subset B of A, 
Ty(B) is about the “right” size, namely n!/?|B|. Unfortunately, this will fail 
as it stands because a few vertices a in A are likely to be below far too few 
or too many vertices in U. So our first step will be to exclude such vertices, 
and certain others, from consideration. 

Formally, we shall begin by looking just at the set A UU. We shall 
prove that the partial order restricted to this set almost surely has certain 
properties, and by symmetry the partial order restricted to A’ U U’ has 
analogous properties. These properties will of course be independent of edges 
between U and U’. Then we prove that, conditional on these properties, the 
partial order Py,» almost surely has large dimension. 

For the sake of convenience, we begin by working in a slightly altered 
model of random orders. Define the model P’(s,p) by taking two disjoint 
vertex sets R and S = [s], with |R| = 9p~'/10. Then we put in edges 
independently, with probability p, between each pair of vertices, not both 
in R. Thus the model restricted to S is just a copy of P,,. All vertices in R 
are to be thought of as “below” S, so a vertex x € R is below y € S if there 
is a vertex z € S such that xz is an edge of the graph, and z < y in P,,. We 
shall apply this with R = A and S = U. For the greater generality required 
to prove Theorem 7, we need to consider cases where s is a little smaller than 
5m log m where again m = p“!. 

For the next few steps, we work in the model P’(s,p) assuming that 
2mlogm <s< $m log m. As mentioned above, we wish to show that there 
is almost surely a large subset Re of R such that, for every (small) subset T 
of Re, Ra, s(T) is not too far from its expected size. There are essentially 
two steps here, first to ensure that this holds for every single-element subset 
of Rg, and then to extend the argument to larger subsets. 

We start with a lemma which is perhaps of independent interest. Let the 
random variable X(s,p) be the number of vertices of S above a given vertex 
x € Rin P’(s,p). Of course, this, is distributed as the number of vertices 
above vertex 1 in P,,1,). The lemma below gives the exact distribution of 
X(s,p), and also some convenient estimates. Note that our estimates are 
only good in the case where pq~* = o(1), which is when s is rather less than 
mlogm. The identity given in part (i) has been obtained independently by 
Simon, Crippa and Collenberg [18], who study in more detail the mean and 
variance of X(s,p) and the behaviour in the limit as s > oo with p fixed. 

In the lemma below, Geom(a~!) denotes a geometric random variable 
with mean a~, i.e., with Pr(Geom(a~') = t) = a(1—a)* for t a non-negative 
integer. The total variation distance dpy(X,Y) between two real-valued 
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random variables X and Y is the supremum over « € R of |Pr(X < 
x)—Pr(Y < 2)|. 


Lemma 1. (i) For0<t<-s, 

Pr(X(s,p)=t)=q>' |] (-¢’). 

i=s—t+l1 
(ti) Suppose p< 1/2. Then 
drv(X(s,p),Geom(q™*)) < 3pq™*. 
(iii) For every t, 
Pr(X(s,p) >t) < (1—@°)’. 

(iv) If p < 1/2, then, for every t, 

Pr(X(s,p) < t) < exp(2pq7*)(1 — (1 — q°)*). 
Proof. Of course, the formula in (i) can be proved by induction on s, using 
the recurrence 

r(t,s) = r(t,s —1)g°t* +r(t —1,s—1)(1—q°), 


where r(t,s) = Pr(X(s,p) = t): see Simon, Crippa and Collenberg [18] for 
details. We prefer to give a combinatorial proof, which is hopefully slightly 
more informative. 

The probability that the vertices of S = [s] above x are exactly k, + 
Lky tho +2,...,k: +... +h +1 is 


ac - “)) (Ils), 


where ki44 = ee k, —t. As the possible vectors (k1,..., k:41) are exactly 
the ordered partitions of s — t into t+ 1 parts, we have 


: t41 
Pr(X(s,p) =t) = (Ie = °) [xe~*] II I a 


i=l i=l 


By Theorem 349 of Hardy and Wright [14], the coefficient of X*~* in the 
product is 


s—t : 
s—t II 1— er 
i=l ! q' ) 


which implies our formula for Pr(X(s,p) = t). 
For the estimates (ii)—(iv), we have that 
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t-1 
a = Pr(X(s,p) =t) =a *[[Q-@%) (<tK<s) 
j=0 
by = Pr(Geom(q7*) = t) = q*(1—q*)’ (0 <2). 
We claim that: 
(1) a; > 6; for all t at most some to, and a, < b; thereafter; 
(2) If p < 1/2, then a/b: < exp(2pq~*) for all t. 
Claim (1) will imply (iii) immediately, since the right hand side in (iii) is 
the probability that Geom(q7*) is greater than ¢. Similarly Claim (2) implies 
(iv) immediately. For (ii), the two claims together give that 


to 


dyy(X(s,p),Geom(q-*)) = S “(ar — by) 
t=0 
a Ss bi(exp(2pq*) — 1) 
t=0 


< exp(2pq*) — 1. 


This final expression is at most 3pq~* whenever 3pq”* < 1. Since the total 
variation distance is certainly at most 1, we thus have the result as stated. 
For j = l,...,s8, set aj = g +(1 — g®)/(1 — q’). Note that a/by = 
(Ries a ;. Observe that the sequence a; starts greater than 1 and decreases 
to 0 by j = s. This establish Claim (1). (Note that the claim is certainly true 
if t > s, when a, = 0 < };.) 
For Claim (2), we have 


Ss 


log a; = — log(1 — p) + log (1 - 4 (qi - 1) 


<pt+p?—q(q/ -1) 
<pt+p —jpq’, 


and hence 


tox(as/be) < »( 1-49) 103) = FO +2 g(t-1)). — (*) 
j=0 


The expression on the right hand side of (*) is maximised for 2¢ — 1 = 
2(1+ p)q~* when it is equal to 
pq * 
5 


as required to establish Claim (2) and complete the proof of the lemma. 
(Here, we have used the weak bounds p < 1/2 and q* < 1.) 


1+ 2p+p? + q° + pg* +. 97*/4) < 2pq7’, 
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Returning to our main thread, we wish to prove that “many” vertices of 
R have about the expected number, namely about q~*, of vertices of S' above 
them. Lemma 1 will suffice for that, and we shall simply ignore vertices of 
R below too few or too many vertices. We also wish to exclude certain other 
vertices a € R, namely those which only have sufficiently many vertices of 
S above them because of the large number of directed paths through one 
particular neighbour a*: the problem with such vertices is that if another 
vertex b of R is also adjacent to a*, the combined neighbour-set I's ({a, b}) 
may be too small. 

It turns out to be sufficient for our purposes to ensure that vertices of R 
are below about the right number of vertices from the bottom portion of S. 
Define then C to be the set of the bottom m vertices of S, and W to be the 
set consisting of the next w = 6m loglogm, with Y =CUW. 

We begin by choosing a large matching among those edges of the random 
graph between R and C. A result in our earlier paper [5, Theorem 14], implies 
that there is almost surely such a matching of size at least m/3. Given such a 
matching M, let Rag be the set of vertices of R incident with an edge of M. 
For a € Ry, let a* be the vertex of C with aa* € M. For a € Ry, let N(a) 
be the set of vertices of W sending a path down to a avoiding a*, and N(a*) 
be the set of vertices of W sending a path down to a*; i.e., N(a*) =Tw(a*). 
The idea is that, if both N(a) and N(a*) are large, then any subset of R 
including a will receive its due “contribution” from a. 

We shall call a vertex a € Ray good if it satisfies 


(i) 5log’m > |Ty(a)|, 
(ii) Both |N(a)| and |N(a*)| are at least z log® m. 


Lemma 2. There are almost surely at least m/12 good vertices in Ry. 


Proof. Let D be the set of vertices incident with M. Note that the choice of 
D depends only on the edges of the random graph between R and C. For 
x € D, let Z(x) be the number of vertices of W having a directed path to x 
whose penultimate vertex is in W. Thus Z(a) C N(a) for x € D. The random 
variable Z(x) is distributed as X(w,p), so by Lemma 1 the probability that 
Z(x) is at most z log® m is at most 


e2py ~ (1 a (1 = gi log log m log® me) =k e 1/8 +4 o(1). 


For x,y € D, the probabilities that Z(a) and Z(y) are too small are not 
independent, so we cannot immediately deduce that, almost surely, many 
vertices a € R have Z(a) and Z(a*) large enough. However, the problem is 
easily overcome. For any subset V of W, and any x € D, the event that 
{vue W:v>2}=V depends only on the set of edges of the random graph 
upwards from V U {2}, so is independent of the random graph restricted to 
(W\ V)U {y}, for any other element y € D. Thus the random variable Z(y), 
conditioned on the event that Z(x) < 3 log® m, dominates a random variable 
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distributed as X(W’, p) where w' = w— log® m. The probability that such a 
random variable is at most ; log® m is again at most 1—e~'/8+0(1). Thus the 
variance of the number of vertices of D below too few vertices in W is o(m?), 
so the number of such vertices is almost surely at most 2m (1 -—e Vb tec 
m/12. Therefore, almost surely, the number of vertices a € Ryy such that a 
and a* both have sufficient neighbours in W is at least m/3—2m/12 > m/6. 
Finally, the expected number of vertices a € Ray below more than 5 log® m 
vertices of CU W can also be estimated from Lemma 1 to be at most 


1 1 
37 exp(—5 log® mgr Form metas 1) 37 exp( Se! + o(1)) < m/13. 


As before, the variance of this number is 0(m”), so there are almost surely 
at most m/12 vertices in Ryy with too many neighbours. Hence the number 
of good vertices in R is almost surely at least m/6 — m/12, as desired. 


Let Re be a set of m/12 good vertices in Ryy. We have that each vertex 
of Rg is below about log® m vertices of W. We need a little more, namely 
that each smallish set T of good vertices has about |T| log® m neighbours 
in W. This follows from the principle that random graphs do not possess 
small dense subgraphs. 


Lemma 3. Almost surely, for every subset T of Ra with |T| =t < m4, 
IT UTy(L)| > tlog® m/16. 


Proof. For any subset T of the whole set R, define G(T) = TUT y(T) and 
g(T) = |G(T)|. Define a spanning forest F(T) in the graph restricted to G(T) 
by choosing, for each element of Ty (T), one edge down to an element of G(T). 

We first prove that, almost surely, for each subset T of R, either g(T') > 
g(t) = tlog® m/16, or the number of edges induced by the random graph on 
G(T) is at most |F'(T)| + 3t/2. Indeed, the probability that a given set T of 
size t fails is at most the probability that (a) g(Z’) < g(t) and (b) of a given 
set of at most (3) —|F(Z)| pairs of vertices, at least 3¢/2 of them are in 
the random graph. The probability of (b) is at most 

3t/2 


a(t)? sea & (2egtP\" — (thos m\"" _ (log! m 
3t/2 P= \3em = m ~\ m3/4 : 


for t < m1/4, The expected number of failing t-sets is thus at most 


3t/2 


log? 3t/2 
m a = 

oe (=) = ((logm)'%m~"/)! < (logm)~*. 
Hence, almost surely, there are no over-dense sets G(T’). We assume from 
now on that this is the case. 

Now let T be a subset of Rg of size t, with g(T) < tlog® m/10. Set 
T* = {a* : a € T}, and D(T) = TUT™. By the above, G(T) spans at 
most 3¢/2 edges other than those in F(T’). For each such edge, take the lower 
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endpoint, and then follow down the edges of F(T) until a vertex of D(T) is 
reached. Thus there are at least t/2 vertices of D(T) that are not hit. Now 
consider the collection of sets N(a), for a a vertex of D(T) that is not hit. 
Each such set has size at least log® m/8, by our choice of Rg. We claim that 
these sets are mutually disjoint. If not, there is a minimal vertex w in some 
pair of them, so w sends edges down to both sets, and one of these edges is 
not in F(T): the path thus generated hits the vertex a defining one of the 
two sets, leading to a contradiction. Thus the union of these sets has size at 
least tlog® m/16, and this union is certainly contained in G(T), which is a 
contradiction. 


Once we have all our small sets “spreading” into the bottom 6m log log m+ 
m elements of S$, it is not hard to show that they continue to spread at a 
very steady rate through S. 


Lemma 4. Almost surely, for every subset T of Ra with |T| =t < m'/4, 
2tes/™ > |T UL s(T)| > te?/™/80. 


Proof. We break S \ Y into consecutive sets of size m/logm, say Agiyi < 
Apia <...< Aj, where! = slogm/m< tL log* m and k = 6 logm log log m+ 
logm. For j7 = k,...,l, set Cj; = YU esas A;. For a set T of size t, and 
k <j <1, we estimate the probability that 


[Poss (DP) — Te, (T)| + 1/logm)| > |Po, (T)| log*/? m, 


given that m°/4 > y = [['o,(T)| > tlog® m/16. 
The number N;(T) of vertices of Aj: sending an edge to To, (T) is a 
binomial random variable with parameters m/logm and 


(p= 7 (1 + O(m-4))., 


Hence we have 
Pr (i (T) 


where we used the Chernoff bounds on the tail of the Binomial distribution, 
the lower bound on y, and implicitly the fact that the variation we tolerate 
is much greater than the error in our estimate of y/logm for the mean of 
N,(T). 

Hence, for each fixed t, the probability that, for some T of size t, and 
some j, the set [o,(T) fails to spread within the prescribed bounds is at 
most 


Te > Y < Qe77/3log* m < Qe7tlog? m/48 
logm| ~ log®/?m/ ~ 7 


Imt lets” m/48 _ o(m—"/4), 


Thus, almost surely, every set [¢,(T’) spreads at the required rate. 


Hence, almost surely, for every set T of size at most m1/4, 
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1 1 i Ve 
P's(T)| > —tlog® m (14 
Ins(P)1> qetlon’ m (14+ = - Ca) 
t 1 1 
> 6 (stogtogm + (ee ~ 6 log m og log m — log (Gon = a) 


> ex Zt 7 
~ 16 ” m mlogm 


> £ os/m-3/2 > St os/m 
16 80 


Similarly, almost surely, for every set T, 


IT s(T)| < 5tlog® m(1 + 1/logm + 10°/log®/? m)!-* 


] 1 
< 5texp (toyioxm + (e" — 6 log m log log m — log) cen) 
m logm 


= §te?/™-1 < 2te8/™, 


This completes the proof. 


There is one more property we want from our model P’(s, p), namely that 
no single element of S is above too many members of R. A reasonably sharp 
bound follows readily from Lemma 1 (iii): the number J(x) of elements of 
RUS below any member z of S is dominated by a random variable distributed 
as X(s+1r,p), so the probability that J(x) is greater than 3e*/” log m is at 
most 


(1 7 getrype™ logm < (1 = (1 _ 1/m)stmy3e"™ log m 


—3e3/™ log m —11/10 —1 
< = = 


so there is almost surely no vertex x € S above more than 3e*/” log m element 
of R. 
Collecting together our results so far, we have the following. 


Lemma 5. Suppose p > 0, and s < $mlogm. Then, almost surely, the 


random order P'(s,p) has the following properties. 


(1) There is a subset Ra of R of size at least m/12 such that, for every subset 
T of Rg of sizet << m4, 


2tes/™ > |T UT s(T)| > te®/™/80. 
(2) No element of S is above more than 3e°/™ logm elements of R. 


Our strategy for the next few steps is as follows. We take four sets 
A, A’, 5, S’ with |A| = |A’| = m/12 and |S| = |S’| = s. (One should think of 
R<S<S' < R’. with a P’'(s,p) random partial order on RUS and another 
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independent copy on the dual of R’ US’, then A = Rg and A’ = RG.) We 
then fix any partial orders < and <’ on AUS and A’ U S” respectively, 
satisfying the conclusions of Lemma 5. To be precise, we state the following 
three properties. 


(P1) Every element of A is minimal in <, and every element of A’ is maximal 
in <’. 
(P2) For every subset T of A of size t < m4, 
2te/™ > |T UT s(T)| > te*/™/80; 
and for every subset T of A’ of size t < m1/4, 


Hee S(T Ur At) Ste) 80, 


where I“4(Z’) is the set of elements of 5’ below some element of T in 
a 

(P3) No element of S is above more than 3e*/” logm elements of A in <, 
and no element of S$’ is below more than 3e*/"" log m elements of A’ in 
a, 


Given orders <, <', <g with: (i) < and <’ orders on SUA and $’U A’ 
respectively, satisfying (P1)—(P3), (ii) <g a bipartite order, in which x <p y 
implies that « € AUS and y € A’US’, we define the order Q(<, <’,<g) to be 
the transitive closure of the union of the three orders. Thus, for a € A,a’ € 
A’,a is below a’ in Q(<,<’,<g) if and only if x <g y for some x € ['g(a) 
and y € IG, (a’). 

Given < and <’ as a bove, we define a random partial order P” = P” 
(<,<',p) on AUS US’ UA’ by taking a random bipartite order <p=<p 
(s,p), with edge probability p, and vertex sets AU S,S’U A’, and forming 
Q(<, <', <p). 

(Note that, if < and the dual of <’ were chosen as random P’(s, p) orders, 
with S immediately below S$’ in <, and then A and A’ selected as sets of good 
vertices, then the random partial order just defined is distributed as the 
restriction of the random graph order to the chosen vertex set, conditional 
on < and <’.) 

Our aim is to prove that, almost surely, the random order P” has large 
dimension. 

We first show that, almost surely, many pairs (a, a’) € Ax A’ are unrelated 
in P”. For this, we may as well assume that s = $mlogm. 

A pair (a,a’) € A x A’ is related exactly when there is some relation of 
<, between an element of I'g(a) and an element of I4,(a’). These sets both 
have sizes at most 2e%/" = 2m/1/?, by property (P2), so the probability that 
the pair is unrelated is at least q*” = e~4(1 + 0(1)). Therefore the expected 
number of unrelated pairs is at least (72)” e~4(1 + o(1)) > m?/8,000. 

Again, we have to confront the problem that the relations between these 
pairs are far from independent. We deal with this by using the following 


64 Béla Bollobaés and Graham Brightwell 


“isoperimetric inequality” of Bollobas and Leader [7] (see also Leader [15]). 
Let Q;(p) denote the “weighted” k-dimensional cube, i.e., the set of subsets 
of {1,...,k} with a set V given weight p!4!q*—!4!. This weight is a probability 
measure, so Q;(p) is thus a probability space. Our intention here is to identify 
{1,...,&} with the elements of (S U A) x (S’U A’), when the probability 
measure Pr, on Q;(p) coincides with our probability measure on bipartite 
orders <g. For A a subset of Q;(p), and 1 € N, we define 


Avy = {B € Q:(p) :|BAA| <1 for some A € A}. 


Finally, for r € N, we define B“”) to be the set of subsets of {1,...,k} with 
at most r elements. The inequality of Bollobds and Leader [7] is as follows. 


Lemma 6. (i) Let A C Q;(p) be a down-set with Prp(A) > Pr,(B"”). 
Then, for every EN, we have Prp(Aqy) = Pi (Br), 
(ii) For z > 0, let AC Ox(p) be either a down-set or an up-set, such that 
Prp(A) > e-*. Then Pr,(Aqy) > 1—e-*, where | = /12zpk. 


Part (ii) of Lemma 6 follows immediately from part (i) on applying the 
Chernoff bounds for the binomial distribution. 

As indicated, we apply Lemma 6 with z = logm,k = (s + m/12)? < 
=m? log? m, and the set {1,...,k} identified with the pairs (x,y) € (SU 
A) x (S’U A’). Let A be the set of those bipartite orders <g giving rise to a 
partial order Q(<, <’,<g) in which at least m?/10,000 pairs (a, a’) € Ax A’ 
are unrelated. Note that A is a down-set. Since the number of unrelated 
pairs has expectation at least m?/8,000 and is bounded above by m?, the set 
A has probability at least 1/4,000 > 1/m = e~*. Therefore, by Lemma 6, 
Prp(Aqy) is at least 1 —o(1), i-e., a random order <g is almost surely within 
1 < 2m1/? log®/? m edges of a bipartite order in A. 

By property (P3) of the orders < and <’, the addition or removal of 
any edge (x,y) between S UA and S$’ U A’ can only change the number 
of unrelated A— A’ pairs by at most (3e%/"logm)? < 9mlog?m. Thus 
the number of unrelated pairs in P” is almost surely at least m?/10,000 — 
(2m1/2 log?/? m)(9m log? m) > m?/20,000. We say that a partial order P” 
with at least m?/20,000 unrelated A — A’ pairs satisfies property (Q1). 

To complete our project, we now have to show that, almost surely, every 
small subset of A has fairly many elements of A’ above it, and vice versa. 
The proof of this is very similar to the previous part. 

Consider any subset T of A of size t < m'/4, and any element x of 
A’. By property (P2), |T UI's(T)| > te’/™/80, and |I%G,(x)| > e%/"/80. 
Thus the probability that x is above some element of T in P” is at least 
1—(1—1/m)te""/"/6.400 > 1 —exp(—te?s/™ /6,500m). Hence the number N(T) 
of elements of A’ above some element of T has mean at least (m/12)(1— 2°), 
where x = exp(—e?*/™ /6,500m). Note that 1 — a > e?8/™/7,000m. 

We again use the result of Bollobés and Leader [7], Lemma 6, to show 
that N(T) is almost surely never far from its mean. This time, we identify 
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{1,...,k} with the set of pairs (a, y) € (TUI's(T)) x (A’US"), so k < 3stes/™: 
clearly N(T) depends only on the relations of <g in this set. 

Let A = A(T) be the set of bipartite orders <g such that N(T) > 
(m/12)(1 — x’) — 1. Note that A is an up-set. Also, EN(T) < (m/12)(1 — 
z') —1+mPr,(A), so Prp(A) > 1/m. 

We apply Lemma 6 with z = tlogm, and deduce that 


=1 
Pr(Aqy) 2 1- ete" = 1 - o(") (("’) mils), 


Therefore, with at least this probability, the addition of at most | = /12zpk 
relations to <g will ensure that N(T) > (m/12)(1 — 2) — 1. Note that 


1 = \/12zpk < V/36tlog mm-'!stes/™ < 6te*/?” log m. 


By property (P3) of <, the addition of any one relation only increases N(T) 
by at most 3e%/™ log m. 
Therefore, almost surely, every set T of size t < m!/* has 

mm 

N(T) > — 

(T) 2 12 

Hence, for each T’,, the number of elements of A’ not above some element of 
T is almost surely at most 


(1 — a*) — 1 — 18te39/?"" log? m. 


5" + 20te?5/2™ log? m. 


We say that a partial order P” satisfying this condition has property (Q2). 
Of course, the analogous property (Q2’) also holds almost surely for all small 
subsets T” of A’. 

Let (Q3) be the property that, for every pair of sets T Cc A, T’ Cc A’, of 
sizes 

to = 10°mlog me~?9/™, 

we have x below y in P” for some x € T,y € T’. We prove that P” almost 
surely has property (Q3). Recall that s > 2mlogm, so that to < m'/4 and 
therefore, by property (P2), |[T UI's(T)| and |T’ U's, (T")| are both at least 
tge*/™ /80. The probability that a particular pair (T, 7”) fails is at most 


—2 
(1 1/m)tae"/"/6,400 < ¢  l0talogm _ 4 ((”) 
0 


Thus the expected number of failing pairs is o(1). 

Now we assume that P” does have properties (Q1), (Q2), (Q2’) and 
(Q3). Thus there is a relation between every pair of subsets of size tg = 
10°m log me~?8/™, every subset T of A of size t < m‘/* has N(T) > 
(m/12)(1 — 2*) — 20te?*/?” log? m, and the analogous inequality holds for 
subsets of A’, and there are at least m?/20,000 unrelated pairs in A x A’. 


66 Béla Bollobaés and Graham Brightwell 


Recall here that « < 1— e28/™ 17, 000m. We now show that these assumptions 
imply that the dimension of P” is large. 

We bound the number of pairs (a,a’) € A x A’ reversed by a linear 
extension L of P”. Consider the top tp elements of A in L, say a, > ag >p 
‘++ >7 ), and similarly the bottom to elements of A’, say a <p +++ <r aj,- 
By property (Q3), there is some pair 7, j(1 < i,j < to) such that a; < aj in 
P", so every pair reversed by L involves one of these 2t) elements. 

For 1 < 7 < to, the number of reversed pairs involving a; is at most the 
number N({a1,...,a;}) of elements of A’ incomparable with all of ai,...,a;, 
and so by (Q2) is at most aI + 20je8/?™ log” m. Hence the total number 
of reversed pairs is at most 


t 
2 S- i (2 D0 je8?! 2" loo" m) 


j=1 \12 
m ma are ; 2 38/2m 7.2 
< ae 1— pom ) + 20toe log? m 
2 
- ee +2 x 10'n2¢e—58/2™ Jop* m 
e 
< 1900m7e 7", 


Therefore, under our assumptions, the dimension of P” is at least 


m2 /1,200m? 
20,000/ — e28/m 


As we have been indicating throughout, this suffices to prove Theorem 6, 
and indeed also Theorem 7. We fill in the details below. 


= 1078e28/™, 


Proof of Theorem 6. The upper bound follows immediately from the upper 
bound on the width given in Theorem 5. 

For the lower bound, we set m = p~~ as usual, and take s = $m log m. 
We are given p > logn/n, so n > 2s + 6m. We define four sets as follows: B 
consists of the <-first 3m elements of [n], U consists of the <-next s elements, 
U' of the <-next s elements, and B’ of the <-next 3m. There are almost surely 
at least 9m/10 maximal elements in the random order P,,, restricted to B, 
and 9m/10 minimals in the order restricted to B’, as noted in [5]. If this is 
the case, then, to be definite, take A to be the set of the 9m/10 ~<-largest 
maximals from B, and A’ to consist of the 9m/10 ~<-smallest minimals from 
B’. Note that the choice of A and A’ depends only on the order restricted to 
B and to B’. 

We next consider the random order restricted to the sets A UU and 
A’ UU’. These are distributed as a random order P’(s,p) and its dual, so 
the conclusions of Lemma 5 hold almost surely for R = A and S = U, and 
dually for R = A’ and S = U’. In other words, the restrictions < and <’ of 


1 
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the random order P,,, to ANU and to A’NU’ almost surely have properties 
(P1)-(P3). 

Conditioned on the restrictions of P,» being equal to any particular 
pair <, <’ satisfying (P1)-(P3), we have seen that the dimension of the 
random order restricted to AU A’, and hence dim P,,,,, is almost surely at 
least 10~8e?8/™ = 10-8m. 

Combining all the above, we see that the dimension of P,» is almost 
surely at least 10~°p~!, as claimed. 


Proof of Theorem 7. Set m = p~' and s = 4(n — 3m). Note that s > 
2 
2m log m. The proof of Theorem 6 goes through without alteration, and we 


deduce that 
dim Py 10a > -e™, 


almost surely, as required. 

For the upper bound, we apply Theorem 2, the upper bound of Fiiredi 
and Kahn [12]. Thus we need an estimate on the maximum degree A of an 
element of Pn». A bound on A follows from Lemma 1 (iii), just as in the proof 
of Lemma 5(2), namely, almost surely, A < 3e?" logn. Theorem 2 now tells 
us that 


dim Php < 50A log? A < 150e?" log n(pn + log log n + 2)” < 40e?” log? n, 


as required. 


We have proved Theorem 6 with 1078 for the value of ¢; clearly this can be 
improved substantially, but tinkering with the method is unlikely to produce 
a reasonable constant. We suspect the result is true with e = 1, at least, and 
maybe it is even true that the dimension is almost surely (1—0(1)) times the 
width. 

On a slightly more abstract note, it is likely that there is a constant c 
such that, if pn/logn > co, plogn + 0 and € > 0, then, almost surely, 


(c-—e)p t< dim Prip < (e+ e)p', 


but this may be rather hard to prove. 
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Summary. Given a hereditary graph property P let P” be the set of those graphs 


in P on the vertex set {1,...,n}. Define the constant cn by |P”| = gen(3), 
We show that the limit limn..cn always exists and equals 1 — 1/r, where 
r is a positive integer which can be described explicitly in terms of P. This 
result, obtained independently by Alekseev, extends considerably one of Erdés, 
Frankl and Rédl concerning principal monotone properties and one of Prémel 
and Steger concerning principal hereditary properties. 
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1. Introduction 


A property P of graphs is an infinite class of graphs which is closed under 
isomorphism. A property P is hereditary if every induced subgraph of every 
member of P is also in P, and is monotone if every subgraph of every member 
of P is also in P; a monotone property is therefore also hereditary. Let P” 
be the set of graphs in P with vertex set [n] = {1,...,n}. In this paper we 
are interested in the rate of growth of P” with n, so it is convenient to define 
the constant ¢, = ¢n(P) by |P”| = 2¢n(2), Note that eventually c, < 1 unless 
P is the trivial property consisting of all graphs. 

Scheinermann and Zito [26] asked if, for a hereditary property, limy+oo Cn 
always exists, and if so what the possible values are. The limit had been 
evaluated earlier for the monotone property of K,,-free graphs by Erdés, 
Kleitman and Rothschild [11], by using the method of Kleitman and 
Rothschild [13]. Their result was generalized by Erdés, Frankl and Rédl [10], 
who considered the monotone property of graphs not containing a given 
graph F' as a subgraph. The structure of K,,-free graphs was investigated 
by Kolaitis, Promel and Rothschild [14]. 
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It is considerably more difficult to determine the asymptotic size of the 
hereditary, but non-monotone, property P of not containing a given graph F’ 
as an induced subgraph; this problem has been studied by Prémel and Steger 
in a series of papers {17,18,19,20,21]. In particular in [18] they gave sharp 
estimates for the property of not containing an induced quadrilateral, and in 
[19] they evaluated lim,,_.,. Cn for every given graph F’.. 

Our purpose in this paper is to evaluate lim, _... Cn for every hereditary 
property P, not only for principal properties, namely those defined by a single 
forbidden subgraph (induced or otherwise). Our main result (Theorem 4) 
claims that this limit equals 1 — 1/r, where r = r(P) is an integer which we 
call the colouring number of P, to be defined below. Since writing this paper 
we have discovered that we were anticipated in the result by Alekseev [2], 
but the present proof appears to be simpler and more natural. In particular, 
it is reasonable to suspect a relationship between the main theorem and the 
Erdés-Stone theorem. In our proof this relationship is established and is made 
transparent. 

The proof of Theorem 4 contains an implicit demonstration of the exis- 
tence of limn—+oo En. In fact, Alekseev [1] already showed that the limit exists. 
Moreover it was shown in [7] that the sequence (c,,) is monotone decreasing 
(and so the limit must therefore exist). The analogous sequence monotonicity 
property was shown to pertain to uniform hypergraph properties, and so the 
corresponding limits again exist. However, for hypergraphs we are unable to 
say anything about the value of the limits, or even whether every real number 
between zero and one is the limit for some property. 

The colouring number r(P) of a property P is defined as follows. Let 0 < 
s <r be integers. An (r,s)-colouring of a graph H is a map =: V(#H) => [r] 
such that H[y~1+(i)] is complete for 1 <i < s and is empty otherwise. Note 
that H is (r,0)-colourable if and only if y(H) <r, since an (r, 0)-colouring 
is just an r-colouring in the usual sense. Note too that a graph is (r,r)- 
colourable if and only if y(H) <r (all notation used but not defined in this 
paper is described in [3]). Now let 


Cy (r,s) ={H : |H| =k and F is (r, s)-colourable}. 
Then the colouring number r(P) of a hereditary property P is defined by 


r(P) =max{r : there exists 0 < s <r such that P D U Cx (r, s)}, 
k>1 


that is, r(P) is the largest integer r such that, for some s, P contains every 
(r, s)-colourable graph. Since a graph of order r is (r, s)-colourable for every 
8,0 <8 <r, it follows that r(P) is finite if P is non-trivial. Note also that 
the only (1,0)-colourable graphs are empty, and the only (1, 1)-colourable 
graphs are complete, so by Ramsey’s theorem r(P) > 1. 

Consideration of the property complementary to P gives us another way 
to view the colouring number of P. A property P is hereditary if, and only if, 
for some sequence F}, Fo,... of graphs, P is the collection of graphs having 
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no induced subgraph isomorphic to an F;. Then 
r(P) =max{r: for some 0 <s <r no F; is (r, s)-colourable}. 


Although monotone properties P are also hereditary, and so the above 
serves to define r(P), it is worth giving the definition in a third and simpler 
form for these properties. A property P is monotone if, and only if, for 
some sequence F), f2,... of graphs, P is the collection of graphs having 
no subgraph isomorphic to an F;. Since any (r,s)-colourable graph contains 
an (r,0)-colourable subgraph, we see that for a monotone property P the 
colouring number is 


r(P) = max{r: no F; is r-colourable}. 


Therefore for monotone properties our result is that limp. ¢, = 1—1/r, 
where r = min{y(F;)} — 1. This is in contrast to the case of hereditary 
properties in general, for which the colouring number is not merely the 
minimum of those colouring numbers of the properties defined by excluding 
a single F;. 

It is immediate that liminf,..¢, > 1-—1/r(P) for any hereditary 
property P. For let s be an integer for which P contains every (r, s)- 
colourable graph. Partition the set [n] into r disjoint classes V,..., V,, where 
|n/r| < |Vi| < [n/r]. Every graph with vertex set [n] in which the subgraph 
spanned by V; is complete for 1 <7 < s, and in which the subgraph spanned 
by V; is empty for s < i <r, is (r,s)-colourable and is therefore in P”. Hence 
|P"| > g-1/r+O(d/n))(3) which shows that lim inf; .o. ¢; > 1—1/r. 

Let us denote by exina(n, P) the maximal number of edges in a graph Go 
of order n, for which there is an edge-disjoint graph G1 on the same vertex set, 
such that every graph G with G; C G C GoUG? is in the hereditary property 
P. This invariant was introduced (for principal properties) by Prémel and 
Steger [19]. Trivially |P"| > 2a?) and it is clear from the construction 
in the paragraph above that exjna(n, P) > (1— Py +0(1))(3); it is, therefore, 
a consequence of our result that this last inequality is, in fact, an equality. 

The proof of Theorem 4 is based on three well-known results, namely those 
of Ramsey [22], of Szemerédi [27] and of Erdés and Stone [12]. Ramsey’s 
theorem states that for each positive integer k there exists an integer R(k), 
such that if n > R(k) and the edges of the complete graph K,, are coloured 
with two colours then there will be a monochromatic complete subgraph of 
order k. Szemerédi’s lemma will be stated and discussed in Sect. 3. The Erdés- 
Stone theorem will be discussed in Sect. 2; in fact, for the present purpose we 
have to prove a slight extension of that theorem. 

Each of the three cited fundamental results asserts the existence of certain 
constants. For the achievement of our aim, which is to describe limy_556 Cn, it 
is sufficient that these constants exist. To investigate the rate of convergence 
of the sequence (c,,) we would need effective versions of these theorems, but 
we make no attempt whatsoever to carry out this investigation. 
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2. An Extension of the Erd6és-Stone Theorem 


Let r,t > 1 and ¢ > 0. The well-known theorem of Erdés and Stone [12] 
states that every graph of order n and size at least (1 — 1/r + €)($) contains 
K,+1(t), the complete (r + 1)-partite graph with t vertices in each class, 
provided n is large enough. Exactly how large n needs to be, as a function of 
r, t and €, was investigated by Bollobas and Erdés [5], Bollobés, Erdés and 
Simonovits [6], and Chvatal and Szemerédi [9]. 

We shall prove here an extension of the Erdés-Stone theorem, in which a 
certain number of ‘forbidden’ edges are added to the graph, and it is required 
that the K,4+1(¢) span no forbidden edge. To be precise, let G and F' be 
two graphs on the same vertex set. We say that a subgraph H of G is F- 
avoiding if V(H) spans no edge of F’. Our theorem shows that if G satisfies 
the conditions of the Erdés-Stone theorem, and if e(F’) is sufficiently small, 
then G will contain an F-avoiding K,4+1(t). 

An alternative proof of the theorem has been pointed out by Rédl [23]; 
his proof depends on a ‘supersaturated’ version of the Erdés-Stone theorem, 
stating that every graph of order n (sufficiently large) and size at least (1 — 
1/r + )(3) contains not just one but en'"+)* copies of K,+1(t). Given this 
theorem, our theorem follows at once since if e(F’) is small it cannot meet 
all of the copies of K,.41(t). However, our purpose here is to give a short 
self-contained proof. For the proof it will be convenient to have the following 
weak form of Turdn’s theorem [29], wherein G denotes the complement of G. 


Lemma 1. I[f G is a graph of order m > t? and e(G) < m?/2t thenG D Kz. 


Proof. By Turan’s theorem, if G Z K; then 


(@ > = (a ! aut 0) : a _ _m(m—t+1) . m 


2 2(t — 1) ae 


as claimed. 


Theorem 1. Given r > 0, t > 1 and e > 0, there exist 6 = d(r,t,€) and 
no = no(r,t,€) such that the following holds. Let F and G be graphs on the 
same vertex set of order n > no, with e(F) < dn? and, ifr >1, with 


1 n 
ee 
e(G) > ( a €) (5) 
Then G contains an F'-avoiding K,41(t) subgraph. 


Proof. Note first that we may assume that F and G share no edges. We 
shall apply induction on r. If r = 0 then Lemma 1 applied to G shows that 
6 = 1/(2t) and no = #? will do. 

Suppose then that r > 1 and that the assertion holds for smaller values of 
r. Let us first make the customary observation that for some m, (€/2)!/2n < 
m <n, G has an induced subgraph G,,,, of order m and minimal degree at least 
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(1—1/r+e/2)m. Indeed, if this were not the case then, with s = |(e/2)!/?n], 
we could find subgraphs G,, = G D> Gyn_1 D... D Gs such that, forn >7> s, 
G; is a subgraph induced by 7 vertices and the only vertex of Giji1 not in G; 
has degree less than (1 — 1/r + €/2)(i+1) in Gigi. But then 


e(Gs) > a-2+a(3) -a-24+ 9 6+y > en? /4> 87/2 


if n is large, which is a contradiction. 

Let F,, be the subgraph of F induced by V(G,,). Note that e(Fin) < 
dn? < (26/e)m?. Let T = [4t/er]. By the induction hypothesis, if n is large 
enough and 6 is small enough, G,, contains an F-avoiding K,(T), say K. 
The bound on the minimal degree of G,, implies that each vertex of K sends 
at least (1 —1/r+e¢/2)m—rT edges to G, — K. We claim that the set U of 
vertices of G, — K sending at least (r — 1+ er/4)T edges to K has at least 
erm/5 members. Indeed, if this were not the case then the number of edges 
f between K and G,, — K would satisfy 


1 
rr + (1 +) m(r 1+ 2)T>f2 f22uF m—rT >rT, 
5 5 4 r 2 


which is false if n is large. 
Each vertex of U is joined to a K,(t) subgraph of K. There are only (7) 
such subgraphs, so for some K,,.(t) subgraph K’ the set W of vertices of U 


joined to every vertex of K’ has at least erm (*) ~" members. Now W spans 
at most (26/¢)m? edges of F, so again by Lemma 1 if n, and hence m, is 
large enough and 6 is small enough, the set W contains ¢t vertices spanning 
an independent set in F’', and hence forming with A’ an F-avoiding K,+,(t) 
subgraph of G. 


3. Universal Graphs 


Given a finite class G of graphs, we say that a graph G is G-universal if every 
member of G is an induced subgraph of G. The classes of graphs of interest 
to us are the classes C;(r, s) of (r,s)-colourable graphs defined earlier. Our 
aim in this section is to show that every suitably large collection of graphs 
has a C;(r, s)-universal member, for some s. 

For the proof of our theorem we will need a lemma of Szemerédi [27]. In 
order to state this lemma the notion of uniformity must be defined: a pair 
of subsets U and W of the vertex set of a graph G is said to be n-uniform 
if |d(U,W) — d(U',W’)| < » whenever U’ C U, |U'| > n\U| and W’ C 
W, |W’| > 7|W|, where d(U,W) = e(U,W)/|U||W|. Szemerédi’s Uniformity 
Lemma is (equivalent to) the statement that, given 7 > 0 and an integer J, 
there is an integer L = L(1,7) such that the vertices of every graph of order 
n can be partitioned into m classes Vi,...,Vm, for some 1 < m < L, so that 
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[n/m] < |Vi| < [n/m] and all but at most ("}) of the pairs (V;,V;) are 
yn-uniform, 1 <i<j<m. 

The following lemma is a standard application of the notion of uniformity 
(see for example [24]); once again we shall give a self-contained proof in a 
form convenient for our needs. 


Lemma 2. Let H be a graph with vertex set {11,...,r@%}. LetO<A,n <1 
satisfy kn < \*—!. Let G be a graph with vertex set UE4 V, where the V; are 
disjoint sets each of order u > 1. Suppose that each pair (V;,V;), 1<i<j< 
k, is n-uniform, that d(Vi,V;) <1—A if aia; ¢ E(H) and that d(V;,V;) > » 
if xx; € E(H). Then there exist vertices v; € Vi, 1 <i<k, such that the 
map XL; > v4 gives an isomorphism between H and the subgraph of G spanned 
by {v1,..., UK}. 


Proof. Observe that by replacing, if necessary, the set of V;-V; edges of G by 
the complementary set of V;-V; edges, we may assume that H is the complete 
graph and that d(V;,V;) > A for all 1 <i <j < k. We shall select, one by 


one, the vertices v1,...,v,% so that after v1,...,v; have been chosen there are 
sets Us CV;,1< 3 <k, such that each of v1,...,v is joined to every vertex 
of U} and |Uj| > (A—7)'u. Clearly we can begin (with 1 = 0) by taking 
U9 = Vj. 

In order to find wj41, having found v1,...,v, let 


Wj = {v € Ube: [P@) NU] < A m)lUs 


forl+1<j<k. Since d(W;, Ul) < A— yn, the definition of n-uniformity 
applied to the pairs (W;,U}) implies that either |W,| < nu or |Uj| < nu. 
However the latter is ruled out because 1 < k — 1 and hence |U}| > (A — 
n)®-1u > (A*-1 — (k — 1)n)u > nu. Hence 


isa \ UY Wal > A= nylu- (b= 1— Iu > (X= In = (k= 1 Inu > 0. 
j>l41 
We may therefore choose a vertex v € U},, so that if we let Uj*! =T(v) NU} 


then |Us | > (A—7)'*1u for 14+ 1 <j <k. In this way we may proceed to 
find each of the vertices v1,..., Uk. 


We define a coloured partition m to be an edge colouring of the complete 
graph on the vertex set [m] which uses four colours, namely grey, red, green 
and blue. The number m is called the order of 7 and is denoted by ||. We 
associate with 7 two graphs, F, and G,, both on the vertex set [ml]; the edge 
set of F, is {ij : ij is grey} and the edge set of G, is {ij : 77 is green}. 

Given a graph G and constants 0 < A,7 < 1, we say that G satisfies the 
coloured partition 7 with respect to \ and n, if there is a partition of the 
vertices of G into |m| classes Vj,...,Vjz|, with [Vil < |V2] < ... < |Viaj| < 
|Vi| +1, such that the pair (V;,V;) is not n-uniform only if ij is grey, and 
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otherwise 0 < d(Vi,V;) < A, A < d(Vi, Vj) < 1-—Aor1l—-A< d(Vi,V;) <1 
according as 77 is red, green or blue. Szemerédi’s Uniformity Lemma asserts 
that, given \, 7 and some integer 1, there exists an integer L = L(1,7) such 
that any graph G satisfies some coloured partitition 7 with respect to A and 
n, with | < |z| < Land e(F,) < n('l). 

The following theorem shows that a graph satisfying 7 will be Cx(r, s)- 
universal if the size of G, is large. We shall use this result only to prove 
Theorem 3, but we state it because it may be useful in further investigations of 
hereditary graph properties. The dependencies of some of the parameters on 
other ones gives the theorem a technical appearance, but these dependencies 
are likely to be crucial in applications. In this theorem and the next one, we 
will use r—1 where in Theorem 4 we used r. Despite the potential confusion, 
we adopt this usage because it seems more natural here. 


Theorem 2. Let r,k € N, r > 2,€ > 0 and0 < X < 1 be given. Then 
there exist positive constants l, = l(r,k,e) and m = mi(r,k,¢,A) with the 
following property. Let 0 < » < m, and let 7 be a coloured partition with 
|r| >, e(F,) < n('a!) and 


e(Gr) > (1 2 —_ +e) a 


Then there is an integer s = s(7),O<s <1, such that every graph of order 
n> |n| that satisfies 7 with respect to X and 7 is Cy(r, s)-universal. 


Proof. Let t be the Ramsey number R(k), let 1, = no(r —1,t, €) and let m = 
min{d(r — 1,t,¢),k~!A*—-!}, where no and 6 are the functions appearing in 
Theorem 1. We shall show that these functions have the required properties. 

By Theorem 1 there is an F;,-avoiding copy of K,(t) in G,,. Consider one 
of the r vertex classes of this K;,(t), say T where |T| = t. The edges ij, where 
{i,j} C T, are coloured either red, green or blue. Let us recolour orange the 
edges coloured red or green. By the definition of t, there is a subset T’ C T, 
|T’| = k, such that the edges 77 where {7,7} C T’, are either all orange or all 
blue. This argument can be applied to each class of the K,(t). Therefore our 
K,(t) in G, contains a subgraph K = K,(k) such that the edge 77 is green if 
i and j are in different classes of K, the edge 77 is blue if 7 and 7 are both in 
one of the first s classes of K, and the edge 7j is orange if 7 and 7 are both 
in one of the remaining r — s classes of kK’, where s = s(7) lies in the range 
O<s<r. 

Let H € C;(r,s), so that |H| = k and H is (r, s)-colourable. Choose an 
(r, s)-colouring of H and label the vertex set of H by 21,...,2, so that the 
first k, vertices have colour 1, the next kz vertices have colour 2 and so on. 
Now construct a subgraph K’ of Kk by choosing k; vertices from the first 
class of K, kz vertices from the second class, and so on. Thus |A’| = k. Let 
G be a graph of order n > |z| that satisfies 7 with respect to A and 7, and let 
G’ be the subgraph of G induced by U{V; : i € V(K")}. The definition of 
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the colouring and the choice of K’ mean that Lemma 2 can be applied to H 
and G’. This shows that H is an induced subgraph of G’ and so also of G. 


We are now able to prove the main theorem of this section, which shows 
that any suitably large collection of graphs will have a C,(r, s)-universal 
member. 


Theorem 3. Let r,k € N, r > 2 ande > 0 be given. Then there exists 
nm, = mi(r,k,e) such that ifn > ni and P” is a collection of at least 
2t-1/(r-1)+ (8) Jabelled graphs with verter set [n] = {1,...,n}, then P” 
contains a C,(r, s)-universal graph for some s,0<s<r. 


Proof. We begin by choosing constants A, | and 7 as follows. Choose 0 < 
X < 1/4 small enough so that (e/A)* < 2°/!°. Now choose 0 < 7 < 
min{e/8,(r, k, €/8, A)} and an integer / larger than max{20/e, l1(r, k, €/8)}, 
where the functions 7, and J, are those appearing in Theorem 2. 

Apply Szemerédi’s Uniformity Lemma, with the usual parameters | and 
n, to each graph G in P”, thus obtaining, for each such graph, a coloured 
partition o(G) which G satisfies with respect to A and 7, with | < |o(G)| < 
L = L(l,n) and e(Foa)) < (leh), Since there are at most 4(2) coloured 
partitions of order less than L and at most n” ways to split the set [n] 
into fewer than L parts, there is some coloured partition 7 satisfied by at 
least n~£4-(2)Q0-V/(r-1)+0)(3) graphs of P”, which number being at least 
g-1/(r-1)+¢/2)(3) if n is large. From now on we consider only this particular 
coloured partition a which is satisfied by many graphs. We shall show that 
any graph in P” satisfying a is Cz(r, s)-universal, where s = s(m), provided 
n is both larger than L and is large enough to ensure the validity of the 
estimates below. This assumption on the size of n will be made without 
further mention throughout the remainder of the proof. 

Our aim is now to show that the coloured partition 7 must have many 
green edges. This can be done by estimating the number of graphs which can 
satisfy the coloured partition 7. Within such a graph there are at most 9(2) 
possible distributions of edges inside a vertex class V;, where N = [n/|z|]. 
Between two classes V; and V; the possible distributions of edges of the graph 
number at most 2%” for grey and green edges 77 of 7, and at most — () 
for red and blue edges. Let f be the number of grey and green edges in the 
coloured partition 7. Then the number of graphs which may satisfy 7 with 
respect to A and 7 is at most 


are < [So MY] ae 


i=0 
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Each of the first two factors here is bounded above by 215 (3). the first because 


mi(3) < pil) 02) 


and the second because 


Ey BOS? << Gy" «ave 
i=0 


It follows that the number of graphs satisfying the coloured partition 7 is at 
e(n 2 n 

most 25(2)+£N , and since we have at least 2t-I/(r-1+e/2)(3) such graphs 

we see that 


1 3e\ (n\ 1 €\ (|r| 
p> (1-4 +3) (5) > (1-5 +5) (19). 


Now f = e(F,) + e(G,), and Szemerédi’s lemma asserts that e(F,) < 
n('2!). Since 7 < €/8, we see that e(G,,) > (1—1/(r—1) +e/8)('!). Theorem 2 
now implies that any graph of order n which satisfies 7 is C,(r, s(7))-universal, 
as claimed. 


4. Hereditary Properties 


Recall that for a non-trivial hereditary property P of graphs the colouring 
number r(P) is defined by 


r(P) =max{r : there exists 0 < s <r such that P D U Cx (r, s)}. 
k>1 


We can now state and prove the main result of this paper. 


Theorem 4. Let P be a non-trivial hereditary property of graphs and let P” 
be the set of graphs in P with vertex set [n| = {1,...,n}. Set |P”| = gen (2), 
Then 


Jim, Cn =1-—1/r(P), 
where r(P) is the colouring number of P. 


Proof. Let r = r(P). We saw in the introduction that liminfy. cn > 
1 — 1/r. Suppose that the assertion of the theorem is false, so that 
limsup,_,o,Cn > 1-—1/r, which is to say there exists « > 0 such that 


|P"| > g(t-1/r+e)(3) for infinitely many values of n. It follows from Theorem 3 
that for each integer k there is an integer s, 0 < s <r+1, such that for some 
n the set C,(r + 1,8) is contained in P” and therefore in P. Consequently, 
for some value of s, we have C,(r +1,s) C P for infinitely many k, and 
hence for all &. But this contradicts the definition of r(P), so our theorem is 
proved. 
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As an example, let P be the property consisting of planar graphs and let 
P’ be those graphs containing no induced subgraph isomorphic to either Ks 
or K3.3. Clearly P C P’. Since Ks is (2, s)-colourable for s = 1,2 and K33 is 
(2,0)-colourable, it follows that r(P) = r(P’) = 1, so |P’| = 20”), 

For a slightly less simple example, let P; be the (monotone) property of 
containing no complete graph 4, and let P2 be the property of containing no 
induced subgraph isomorphic to the 7-cycle C7. Now let P be P1 NP3; that is, 
let P be the property of having no induced subgraph isomorphic to either Ky 
or C7. Each of K4 and C7 is (4, s)-colourable for every s, 0 < s < 4. Now Ky 
is (3, s)-colourable for 1 < s < 3 but is not (3, 0)-colourable, so r(P1) = 3. On 
the other hand, C7 is (3, 0)-colourable but has no (3, s)-colouring if s = 2,3, 
so r(P2) = 3. Finally, neither K4 nor C7 has a (2,0)-colouring so r(P) = 2. 
Consequently |P?| = 2”’/3 and |P2| ~ 2”’/3, whereas |P"| = 2”"/4. 


5. Additional Remarks for the Second Edition 


The preceding text is (apart from one or two updated references) the same 
as that in the original edition of this book. But the subject has moved on 
since the paper first appeared: we give here just some pointers to further 
developments. 

For smaller hereditary graph properties, namely those for which r(P) = 1, 
the theorem here gives little information. The sizes of such properties, as 
well as the sizes of hereditary properties for numerous other structures (two 
examples being oriented graphs and permutations), have been extensively 
investigated by many people, including Balogh, Morris and the first author. 
There is by now an enormous literature: for more, the reader is advised to 
begin with the survey [4]. 

The size of the hereditary property P is determined by the probability 
that a random graph G(n,1/2) has property P, and the corresponding 
probability in the model G(n, p) has been studied by us in [8]. In [16] Marchant 
and the second author show how these probabilities can be calculated more 
readily because of their relationship to extremal functions for weighted 
multigraphs, as developed in [15] and surveyed in [28]. 

Finally, we remark that a different approach to several of these results, 
avoiding the use of Szemerédi’s Lemma, was found by Saxton and the second 
author [25]. 
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Summary. Let G be a triangle-free graph of order n and minimum degree 6 > 
n/3. We will determine all lengths of cycles occurring in G. In particular, the length 
of a longest cycle or path in G is exactly the value admitted by the independence 
number of G. This value can be computed in time O(n?) using the matching 
algorithm of Micali and Vazirani. An easy consequence is the observation that 
triangle-free non-bipartite graphs with 6 > an are hamiltonian. 


1. Introduction 


In recent years a lot of research was performed on sufficient conditions for 
cycles in graphs which do not contain certain graphs as induced subgraphs. 
Mostly one of the forbidden subgraphs is the claw Ky 3. 

A completely different class of graphs are triangle-free graphs, i.e. graphs 
containing no K’3 (which is always an induced subgraph). Note that no graph 
with maximum degree more than 2 is claw-free and triangle-free at the same 
time. Most attention focused on a very special class of triangle-free graphs, 
namely bipartite graphs. Our main objective are cycle lengths in triangle-free 
non-bipartite graphs. 

Haggkvist [14] proved that computing the circumference (i.e. the length 
of a longest cycle) in general graphs of order n with minimum degree 
6 > ($—€)n is NP-hard for every « > 0. Note that for ¢ = 0 the 
circumference is n by Dirac’s celebrated result [12]. It seems even difficult to 
determine precisely the circumference or the length of a longest path in terms 
of other invariants of the graph (whose determination might be NP-hard 
themselves). 

The situation changes if we consider triangle-free graphs. The main 
observation in this paper is that in triangle-free graphs with minimum 
degree 6 > n/3 the circumference is exactly min{n,2(n — a)} and the 
length of a longest path is min{n — 1,2(n — a)} where a denotes the 
independence number of the graph, and both values can be computed in 
polynomial time. A simple derivation of the circumference result is that 
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triangle-free non-bipartite graphs with minimum degree 6 > 3n/8 are 
hamiltonian. 

Moreover, we obtain that triangle-free non-bipartite graphs with 6 > n/3 
are weakly pancyclic, i.e. they contain all cycles between their girth (length 
of a shortest cycle) and circumference. This settles the triangle-free part of a 
result of Brandt, Faudree and Goddard (see [7]) who prove that every graph 
with minimum degree 6 > (n + 2)/3 is weakly pancyclic or bipartite. 

For 2-connected graphs with an odd cycle of length at most 5, Brandt, 
Faudree and Goddard [8] proved that even the degree condition 6 > n/4+¢ 
for a moderately large constant c suffices to ensure that the graph is weakly 
pancyclic. 

Very recently, substantial progress was obtained concerning cycles in 
triangle-free non-bipartite graphs. Dingjun Lou [15] proved that triangle-free 
non-bipartite graphs # Cs satisfying the famous Chvatal-Erdés condition for 
hamiltonian graphs [11] are weakly pancyclic with girth 4 and circumference 
n, thereby answering a conjecture of Amar et al. [3] in the affirmative. 
Bauer, van den Heuvel and Schmeichel [5] proved that there are triangle- 
free graphs with arbitrary large toughness which contradicts a conjecture of 
Chvatal [10] saying that there is a constant to such that to-tough graphs 
are pancyclic. Subsequently Alon [1] determined graphs with arbitrary large 
girth and toughness. 

Since there are still extensive variations in the notation of standard 
invariants of a graph G = (V(G), E(G)) in graph-theoretical literature we 
give a brief collection of the notation used in this paper: 


a(G) (vertex-)independence number of G, 

K(G) (vertex-)connectivity of G, 

v(G) edge-independence number (or matching number) of G, 
w(G) number of components of G, 

6(G) minimum degree of G, 

A(G) maximum degree of G, 

c(G) circumference of G, i.e. the length of a longest cycle, 
p(G) length (i.e. number of edges) of a longest path of G, 


If there are no ambiguities we frequently omit the explicit reference to the 
graph, by simply writing a, «, v, etc. For the meaning of these parameters 
we refer the reader to introductory graph theory literature, e.g. [9]. As usual 
|G| denotes the order of G. Moreover, for a subset S C V(G) we denote 
the induced subgraph of V(G) \ S by G—S, and for v € V(G) we denote 
the number of neighbors of v in S$ by d(v, S$’). If H is a fixed subgraph of G 
we briefly write G — H and d(v, H) instead of G — V(H) and d(v, V(#)), 
respectively. 

The toughness of G # K,, is 


7(G) := min |S|/w(G—S) 
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where the minimum is taken over all separating vertex sets, i.e. sets S C V(G) 
with w(G—S) > 1. A graph is called t-tough if 7 > t. Cycles and paths on 
k vertices are denoted by Cy (Px, resp.). 

Finally, for a graph G and a fixed subgraph H we say that H is matching 
compatible, if v((G — H) > v(G) — |$|H|]. Note that for graphs G with a 
perfect matching there are only even order subgraphs which are matching 
compatible. Matching compatible paths and cycles in bipartite graphs and 
digraphs were considered by Amar and Manoussakis in [4]. 


2. Main Results 


Theorem 1. Let G be a triangle-free non-bipartite graph of order n and 
independence number a. If the minimum degree 6 > n/3 then G is weakly 
pancyclic with c(G) = min{n, 2(n — a)} and girth 4 unless G = Cs. 


Theorem 2. Let G be a triangle-free graph of order n and independence 
number a. If 6 > n/3 then p(G) = min{n — 1, 2(n-—a)}. 


Consider the graphs G(r) and G(r) obtained from Cs by replacing four 
vertices by a set of r independent vertices and the fifth vertex by 2r (2r +1, 
resp.) independent vertices and by joining two sets by a complete bipartite 
graph whenever the original vertices in Cs were adjacent (see Fig. 1). 


are+1 arte 


Fig. 1 The graphs Gi(r), Go(r) and G(r). 


The graph G(r) has minimum degree 6 = n/3 and independence number 
a = n/2 but no hamiltonian cycle, thus Theorem 1 is best possible. Moreover 
the Petersen graph has 6 = (n—1)/3 and a = 4 but it contains only cycles of 
lengths 5, 6, 8, 9, so it is not weakly pancyclic. The graph G2(r) shows that 
Theorem 2 is best possible, since it has 6 = (n— 1)/3 and a = (n+ 1)/2 but 
no hamiltonian path. 

As an easy consequence of Theorem 1 we obtain a degree bound for 
triangle-free non-bipartite graphs to be hamiltonian. 


Theorem 3. Let G be a triangle-free non-bipartite graph of order n. If 6 > 
3n/8 then G is hamiltonian. 
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Consider the graph G'3(r) obtained from Cs by replacing two consecutive 
vertices of Cs by r independent vertices and the remaining three vertices by 
2r +1 vertices each and by joining sets in the same way as for G(r) and 
G2(r) (see Fig. 1). This graph has 6 = (3n — 1)/8 but is non-hamiltonian, 
thus Theorem 3 is best possible. 

There is a related result for bipartite graphs involving smaller degree 
constraints, which is easily obtained by combining results from [2] and [4]. 
We will denote the graph consisting of two complete balanced bipartite graphs 
K,., intersecting in one vertex by H(r). 


Theorem 4. Let G be a bipartite graph of order n > 3. If 6 > n/4 then 
c(G) = 2(n— a) and G contains all even length cycles between 4 and c(G) 
unless G = Co, H(r). 


Now, by combining Theorem 1 with Theorem 4 we immediately obtain 
the following Corollary. 


Corollary 1. Let G be a triangle-free graph of order n > 3. If 6 > n/3 then 
c(G) = min{n, 2(n — a)} and G contains all even length cycles between 4 
and c(G) unless G = Cs. 


As a consequence of this observation we will show that the circumference 
of such graphs can be computed in polynomial time. 


Theorem 5. The circumference of every triangle-free graph of order n with 
5 > n/3 can be computed in time O(n?:°). 


This result is based on the matching algorithm of Micali and Vazirani 
[16]. H. J. Veldman [19] proved that computing the circumference in the 
class of bipartite graphs with 6 > (4 —¢) n is NP-hard for every « > 0 by a 
variation of Haggkvist’s simple construction for general graphs [14]. It would 
be interesting to know whether there is a constant c < 1/3 such that the 
determination of the circumference in triangle-free graphs with 6 > cn is still 
polynomial. 


3. Matchings and Independence Number 


The number of edges in a maximum matching v(G) of any graph G of order 
n is bounded by t 
v(G) < min{|3n], n—a(G)}. (3.1) 
Our first step will be to show that for triangle-free graphs with 6(G) > n/3 
equality holds in (3.1). 
For the further results in this paragraph it is helpful to observe the 
following sequence of inequalities for a triangle-free graph G: 


a(G) > A(G) > 6(G) > K(G). 
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The second and third inequality hold in arbitrary graphs. The following result 
ensures that for triangle-free Paths with 6 > n/3 the third inequality is an 
equality. 

Lemma 1. Let G be a triangle-free graph. Then K(G) > min{d(G), 
46(G) — n}. In particular, if 6(G) > n/3 then K(G) = 6(G). 

Proof. If «(G) = 6(G) there is nothing to prove. So assume K(G) < 6(G) —1 
and let S be a disconnecting set of cardinality «(G) and G; and G2 component 
of G—S. Let vu; € V(G;), i = 1,2. Since K(G) < 6(G), v; has a neighbor w; € 
V(G;). Since G is triangle-free v; and w; have no common neighbor. So from 

45(G) < S5 (d(v;) + d(wy)) < |Gi| + |@2| + 215] < n+ K(G) 
i=1,2 


we derive K(G) > 46(G) — n. 


In the next result it will be shown that for triangle-free graphs with 
6 > n/3 which are not 1-tough the complements of maximum independent 
sets determine the toughness. 


Theorem 6. Let G be a triangle-free graph with 6 > n/3 and toughness 
T <6/(n—26+4+1). Then tT = (n—a)/a. 


Proof. Obviously + < (n — a)/a so it remains to prove “>”. Suppose S' is a 
separating set with 7 = |.S|/w(G—S) < (n—a)/a. Then G— S is not the 
empty graph, so there is an edge vw in G— S. By Lemma 1 we have 


|S| > K(G) = 6(G), 


and since all neighbors of v and w are distinct, and lie in the same component 
of G — S or in S we get 


w(G-—S)<n-26+4+1 
thus +r = |$'|/w(G— S) > 6/(n—26 +1), a contradiction. 


Note that Gi(r) is a graph with 6 = n/3 and a = n/2 which is 
not 1-tough, So its toughness is not determined by the complements of 
maximum independent sets. While 1-toughness is a necessary condition for 
being hamiltonian it is a sufficient condition for even order graphs to contain 
a perfect matching. This follows from a famous result of Tutte [18] saying 
that a graph contains a perfect matching if and only if for every subset S 
of the vertex set o(G — S) < |S| holds, where o(G — S) denotes the number 
of odd order components of G — S. We will need the defect version of this 
result, which was first discovered by Berge [6]. 


Theorem 7. The maximum number of independent edges of a graph G is 


WG) 1 2 (CASAS, 


az scv(Q) 2 
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Now the main result of this paragraph is an easy consequence of the two 
previous results: 


Theorem 8. If G is triangle-free with 6 >n/3 then 
v(G) = min{| $n], n— a}. 


Proof. By (3.1) it suffices to show v(G) > min{| $n], n—a}. If the toughness 
T > 6/(n — 26 +1) then for every S we have 


w(G — S) < || (gn+1) /zn=|S| +3]S|/n< |S] +2 


since |.S| < 2n/3 (otherwise w(G—S) < n/3 < |S|). So o(G—S) < w(G—S) < 
|S|+ 1 and using Theorem7 we get v(G) > (n — 1)/2, thus v(G) = [gn]. 
Otherwise using Theorem 6 we have for every S$ 


o(G— S)—|S| <w(G—S)-|S|<a-(n-—a) =2a-n 


implying v(G) > n — a again using Theorem 7. 


Paths 


We are now going to prove Theorem 2. 


Proof of Theorem 2. Clearly a longest path has at most s = 1+ min{n — 
1,2(n — a)} vertices. Moreover, by Theorem 8, v(G) = min{|5n],(n — a)}. 
First observe that G contains a path which is matching compatible, having 
odd order if v(G) < n/2. Let P = Py be a longest such path and suppose 
£<s. Let v and w be the end vertices of P. Note that the subgraph induced 
by P cannot be hamiltonian, since by Lemma 1 the graph G is connected, 
yielding a longer matching compatible path. Thus, by Ore’s Lemma [17], 
6 < min{d(v, P),d(w, P)} < (€—1)/2, implying ¢ > 2n +1. Since  < s we 
have v(G — P) > 1 so there is an edge xy of a maximum matching in G — P. 
Again we calculate d(z,G — P)+d(y,G— P) < n— & which implies 


d(x, P) + d(y, P) > 26 —(n—£ > l—n/3 > (€-1)/2 


since £ > n —1. By the maximality of @ neither v nor w can have a neighbor 
in {x, y}. So there must be two consecutive vertices in P—v—w both having 
a neighbor in {x, y}. Since G is triangle-free this contradicts the maximality 
of @. Thus @ = s so, indeed, p(G) = s — 1. 


4. Lassos 


An é-lasso Lye, k > € > 3, consists of a path x1)a%2...a%-12%, with the 
additional edge x2, and we refer to k as the length of L;,¢. A lasso is even 
or odd according to ¢ being even or odd. In our proofs 5-lassos L; 5 are 
very important, and we will always assume that the vertices of a 5-lasso are 
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labeled as in Fig. 2. Note that in graph theoretical literature the lassos are 
also called cups or lollipops. 


Fig. 2 The 5-lasso. 


Lassos and Cycles 


The only aim of this part is to discover a surprising property of triangle- 
free graphs. The existence of a subgraph L;,5, in a triangle-free graph with 
6 > n/3 implies the existence of cycles of all lengths between 5 and k—1. Here 
n/3 is best possible since the Petersen graph has 6 = (n — 1)/3 and contains 
L40,5 but does not contain C7. For the proof it is helpful to define 5-lassos 
with the additional edge x32¢ which we will denote by Lj; (see Fig. 3). 


Fig. 3 The lasso ee 


Theorem 9. Let G be a triangle-free graph containing a lasso L = Ly 5 for 
k>5. Ifd > n/3 then G contains Cy for all €,5 <€<k—1. Moreover, if L 
is a longest 5-lasso then G contains C = Cy_2 such that G—C contains an 
edge. 


Proof. We will show that the following two claims are true whenever G 
contains Dx 5: 


(1) G contains Cy_1, 
(2) G contains Ly41,5 or G contains a Cy_2 where G — Cx_2 contains an 
edge. 
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Since with L;,5 all shorter 5-lassos are contained in G the claims (1) 
and (2) prove our statement. 

For a lasso L = Ly,5 define X_ = {ilv,2i-1 € E(G)}, and X; = {t]x,;x; € 
E(G)} forl <j <k. 

First suppose that G contains a lasso L = Lz5 but no subgraph 
isomorphic to ae So 2, has at most one neighbor in {x1,22,...,25} and by 
symmetry we may assume 7,21 ¢ E(G). Moreover, xg is adjacent to neither 
“2 nor x3. If a vertex v € V(G — L) is adjacent to two vertices of x2, x3 and 
x, then it is easily seen that G contains a longer 5-lasso and a Cy_1 because 
G is triangle-free. So d(x2,G — L) + d(x3,G — L) + d(a,,G@— L) <n—k. 

Consider X = X2U X3U Xx. By the above reasoning on the neighbors 
of xg and x, and by the observation that x5 is adjacent to neither x2 nor 
x3 we get that either 5 ¢ X or 6 ¢ X thus |X| < k—1. On the other hand 
we have 


d(x2, L) + d(a3, L) + d(xz, L) > 36 -(n-—k) > k > |X|. 


Thus there is an index i such that i € X, andi € X; for j € {2,3} since G is 
triangle-free. So, again, it is easily verified that G contains C,_; and a Ch_2 
with an edge in G — Cy_2. 

Now assume Lt = Lis C G. For k = 6 the lasso Lés contains C's and 
a Cy with an edge in G — ven so assume k > 7. Consider this time 71, r2 
and x,. By the same reasoning as above we are done if there is a vertex in 
G — L* adjacent to two of them. Since G is triangle-free_ X; M X2 = @ and 
X,U Xe C {1,2,3,5,7,8,...,&}. Moreover, we are done if x, is adjacent to 
one vertex of 21, £2, £3 Or 25, SO we may assume X, C {5,7,8,...,k}. Thus 
we have X = X,UX2UX, C {1, 2,3,5,7,8,...,k}, in particular |X| < k—2. 
Again we calculate 


d(z1, Lt) +d(xo, L+)+d(a,, Lt) > 36-(n—k) > |X|4+2 


so X,; and X; U X_ must intersect in an index i 4 5, yielding a C,_,; anda 
Cr—2 with an edge in G — Ch_2. 


Lassos and Matchings 


The previous observation suggests looking for long 5-lassos in triangle-free 
graphs in order to obtain many cycles at once. We will now examine how to 
determine the length of a longest 5-lasso in such a graph. The starting point 
will be a powerful result relating the length of a longest path to the length 
of a longest cycle. 


Theorem 10 (Enomoto, van den Heuvel, Kaneko, Saito [13]). Let 
G be a graph of order n. If every set of three independent vertices u, v, w 
satisfies d(u)+d(v)+d(w) > n then c(G) > p(G) or G is a spanning subgraph 
of one of six exceptional families of graphs each of which has connectivity 
K<n/3. 
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For our purposes it suffices to replace the degree sum condition by 
5 > n/3. Note that the condition c(G) > p(G) for connected graphs implies 
equality unless G is hamiltonian where c(G) = p(G) +1. 


Lemma 2. If G is a triangle-free non-bipartite graph of order n > 6 with 
6 >n/3 then G contains a lasso L = Ly,5 which is matching compatible and 
where k is odd if n is odd and a < n/2. 


Proof. First assume a > n/2. Take an independent set A, |A] = a, and a 
maximum matching M, |M|=n-—a, which exists by Theorem 8. Note that 
M saturates every vertex of G— A. Since G is not bipartite G— A contains 
an edge vw. Moreover, because 26 > n — a the matching neighbors v’ and 
w’ of v and w, respectively, must have a common neighbor u € G— A. Now 
the graph spanned by the set {u,u’,v,v’,w,w’} where u’ is the matching 
neighbor of u contains a lasso L = Le 5 and G— L contains a matching on 
vy — 3 edges. 

For a < n/2 we will first show that G contains an odd lasso Ly ¢ where 
G—Ly,¢ has a perfect matching. From Theorem 2 we obtain that p(G) > n—1. 
Since G satisfies the hypothesis of Theorem 10 and using Lemma 1 we infer 
that G cannot be an exceptional graph, so c(G) > n — 1. If c(G) = n then 
every hamiltonian cycle has an even length chord since G is non-bipartite. 
This yields a spanning odd lasso L,, ¢. If c(G) =n — 1 then we have Dyn-1 
and we are done if n — 1 is odd. Otherwise recall the standard labeling of the 
lasso vertices. If x, is adjacent to an odd labeled vertex xe then we obtain 
Ln 242. If tp, is adjacent only to even labeled vertices then since a < n/2 
two even labeled vertices must be adjacent. So we get an even length chord 
splitting the cycle into an odd cycle where x, has a neighbor and an odd 
length path. So we get a perfect matching from the path and an odd lasso. 

Take a smallest order odd lasso L = Ly ¢ where G — L has a perfect 
matching. Such a lasso exists by the above reasoning. Clearly we may assume 
k < £+1 since otherwise we add the edge 7,2,_1 to the matching. Note that 
the cycle of the lasso is an induced cycle in G since every chord yields a 
shorter odd lasso and a perfect matching. If 2 = 5 we are done so suppose 
£> 7. Now neither x3 nor x4 can be adjacent to x41 so for X = {x3, 24, 20} 
we calculate 


S° d(x,G — L) > 35-6-(k-0) >n—k. 
rex 
So one of the (n — k)/2 matching edges in G — L has at least 3 neighbors in 


X. It is now easily checked that in all cases we obtain a shorter odd lasso L’ 
where G — L’ has a perfect matching. 


It should be mentioned that there is a direct way of proving Lemma 2 
without using the powerful Theorem 10, at the expense of some tedious case 
analysis. However this might be of interest for giving an algorithm which 
actually finds cycles of given length in triangle-free graphs with 6 > n/3. 
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Proposition 1. Every triangle-free non-bipartite graph G with 6 > n/3 
contains a lasso Ls5 where s = min{n, 2(n —a)}. 


Proof. Clearly every 5-lasso in G can have at most s vertices since at most 
half of its vertices form an independent set. Let r be the length of a longest 
lasso L = L,5 contained in G which is matching compatible and which has 
odd order if a < n/2 and n is odd. By Lemma 2 such a lasso exists and it 
is easily observed that L has even order if a > n/2 and order of the same 
parity as the order of the graph otherwise. Consider the end vertex x, of the 
lasso. If x, has a neighbor in G — L then the maximality requirement for r 
is obviously violated. So from (n + 1)/3 < d(z,,L) < (r — 1)/2 we obtain 
r > (2n+5)/3. 

Now consider a matching edge vw in G — L. First observe that v and 
w have together at most 4 neighbors in {21,...,76}, since if one vertex, 
say v, is adjacent to x6 the other vertex w cannot be adjacent to a vertex 
with a smaller index without enlarging the length of the lasso by two. Since 
d(v,G— L)+d(w,G — L) < n—r we obtain 


d(v, L) + d(w, L) > (2n+ 2)/8-—n+r=r-— (n—2)/3 > (r+3)/2. 
Thus 
d(v, {x7,...,tr}) + d(w, {a7,...,¢r}) > (r —5)/2. 


Since G is triangle-free, and v and w are both non-adjacent to x, they must be 
adjacent to two consecutive vertices on the path rgv7...x¢,-—1, contradicting 
the maximality of the lasso. So we conclude that the lasso must have at least 
2u(G) vertices and one vertex more if a < n/2 and n is odd. Now Theorem 8 
completes the proof. 


5. Proofs of the Main Results 


Non-bipartite Graphs 


In order to carry out the proofs of Theorems 1 and 3 we need a better bound 
for the independence number than the obvious bound a < n — 6 holding for 
arbitrary graphs. 


Lemma 3. For any triangle-free non-bipartite graph the inequality a < 
min{2(n — 26),n—6 — 1} holds. 


Proof. Let A be an independent set on a vertices. Since G is non-bipartite 
G — A contains an edge xy. Since no vertex from A is adjacent to both x 
and y we get a+ 6 <n. implying a < n—6-—1. Now assume d(x,G — A) > 
d(y,G — A). Since every neighbor of z in A has all its neighbor in G— A 
we have d(a,G — A) < (n— a) — 6. Furthermore x and y have no common 
neighbor in A. Thus 
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26 < d(x) + d(y) < 2(n—a— 6) +A], 


implying a < 2(n — 26). 


We are now prepared to prove the main result. 


Proof of Theorem 1. First note that there is no graph of order n < 8 
satisfying the hypothesis. By Proposition 1 the graph G contains a lasso 
L=L, 5 where r = min{n, 2(n—a)}. In particular G contains C5. A simple 
counting argument proves that there must be a vertex in G— Cs having two 
neighbors in C5, hence G contains C4. 

Now consider L. Theorem 9 implies that G contains all cycles of lengths 
5<€<r—landacycle C = C,_2 where G—C contains an edge vw. Using 
Lemma 3 we get a < (2n — 4)/3, thus r > (2n + 8)/3. Since d(v,G — C) + 
d(w,G—C)<n—r+2 we get 


d(v, C) + d(w,C) > 26 - (n—r+4+2) > |C|/2. 


Thus there must be two consecutive vertices on the cycle each having a 
neighbor in vw. Since G is triangle-free this provides the missing C’.. 


ow 


Proof of Theorem 8. This is immediate from Theorem 1 since by Lemma 
we obtain a(G) < n/2. 


Bipartite Graphs 


The following result of Amar and Manoussakis [4, Corollary 4] settles the 
circumference part of Theorem 4. 


Theorem 11. Let G be bipartite with bipartition AU B, |A| < |BI. If 6 > 
|A|/2 then c(G) = 2|A| or G= H(r). 


Now the following local condition of Amar [2] provides all the even length 
cycles but the 4-cycle (actually Amar’s result gives more information than 
we mention here). 


Theorem 12. Suppose C is a hamiltonian cycle in a bipartite graph G and 
x and xt* are vertices at distance 2 in C such that the neighborhoods of x 
and «tt are not identical. If d(x) + d(x**) > (n+1)/2 then G contains all 
even length cycles between 6 and n. 


Proof of Theorem 4. For a bipartition AU B, |A| < |B|, fix a cycle C of 
length 2|A| which is guaranteed by Theorem 11 and consider the subgraph 
of G induced by C. This graph contains every vertex of A. If |A] = 2 the 
conclusion is certainly true so assume |A| > 3. Thus |BMC| > 3 and it 
is easily calculated that among any three vertices of BM C there is a pair 
whose neighborhoods intersect in at least two vertices (unless G = Cg), so 
G contains C4. If the neighborhoods of all vertices in BMC, are identical 
then the subgraph induced by C is complete balanced bipartite otherwise 
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there are two vertices in BMC at distance 2 in C satisfying the hypothesis 
of Theorem 12. So in any case G' contains cycles from 4 through 2|A]|. Since 
B is an independent set 2|A| > 2(n — a). 


Computing the Circumference 


Proof of Theorem 5. Determine a maximum matching M in O(n?°) time 
using the algorithm of Micali and Vazirani [16]. By Corollary 1 and Theorem 8 
we have c(G) = 2v(G) unless v(G) = (n—1)/2 and a(G) F¥ (n+1)/2, in which 
case c(G) = n. So if vy = (n — 1)/2 select a set S’ of vertices as follows: take 
the vertex outside the matching and the matching neighbors of its neighbors. 
Define S as the set of vertices having no neighbor in S’. Since |S’| > 6+1 and 
|N(v) UN(w)| > 26 for every edge vw, at most one vertex of every matching 
edge is in S. It is easily checked that a = (n + 1)/2 if and only if S is an 
independent set of cardinality (n + 1)/2. Finally observe that computing S 
and checking its independence can be done in a straightforward way in O(n?) 
time, once the matching is given. 
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Summary. This is a summary of problems and results coming out of the 20 year 
collaboration between Paul Erdés and authors. 


1. Introduction (by Paul Erdés) 


Over a 20 year period, my friends at Memphis State University and I have 
done a great deal of work in various branches of combinatorics and graph 
theory. We have published more than 40 papers and posed many problems. 
Here I only mention two examples. We wrote the first paper on the size 
Ramsey number [35] and stated many problems and conjectures in this 
subject. Some of our questions have been answered by Beck [3] but many 
of them are still open, and the subject is very much alive. We also proved 
the conjecture of Hajnal and myself concerning monochromatic domination 
in edge colored complete graphs. 

During my present visit, many new problems were raised. I state only one. 
Let f(n,k) be the smallest integer for which there is a graph with k vertices 
and f(n,k) edges in which every set of n+2 vertices induces a subgraph with 
maximum degree at least n. This problem was raised in trying to settle an 
old conjecture of ours: is it true that for every m > 2n every graph with m 
vertices and os ") — (5) — 1 edges is the union of a bipartite graph and a 
graph every vertex of which has degree less than n. Faudree has a very nice 
proof for m = 2n+ 1. We tried to prove it for m = 2n + 2 and this led us to 
our problem. 

I hope that if I live we will have many more new problems and results. 
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2. Generalized Ramsey Theory 


2.1. Terminology and Notation 


Let G1, G2,...,G, be a sequence of graphs, each having no isolated vertices. 
The Ramsey number r(G1,G2,...,Gs) is the smallest integer r so that in 
every k-coloring of the edges of K;,, there is a monochromatic copy of G; 
in the 7th color for at least one i. Most of the known results concern the 
two-color case. In case k = 2 and G; = Gp = G, we denote the Ramsey 
number by r(G). Throughout this paper, the notation will generally follow 
that of the well-known text by Chartrand and Lesniak [21]. In particular, the 
graph theoretic functions 6(G), A(G), 6(G) and x(G) are used to represent 
the minimum degree, maximum degree, independence number, and chromatic 
number, respectively, of the graph G. 


2.2. Cycle Versus Complete Graph 


The cycle-complete graph Ramsey number r(C,,, K,,) was first studied by 
Bondy and Erdés in [5]. They obtained the exact value in case m is large in 
comparison with n. 


Theorem 1 ([5]). Form >n?— 2, 
r(Cm, Kn) = (m—1)(n—1) +1. 
Question 1 ([34]). What is the smallest value of m such that 
r(Cm, Kn) = (m—1)(n—1)41? 
It is possible that r(Cm, Kn) = (m—1)(n—1)+4+1 for all m>n. 


Additional results concerning cycle-complete graph Ramsey numbers were 
obtained in [34], among them the following upper bound. 


Theorem 2 ([34]). For allm > 3 and n> 2, 
t(Cms Kn) < {(m — 2)(n¥* + 2) + 1}(n— 1), 
where k = |(m—1)/2]. 
For m fixed and n > oo, the probabilistic method [57] yields 


(m—1)/(m—2) 


Cm, Kn) > ¢ | —— 
zc jee (= 
The case of m = 3 is well studied. By the probability result, r(C3, K,) > 
c(n/logn)?, and in [56] Shearer improves the technique of Ajtai, Komlés and 
Szemerédi [1] 


r(C3, Kn) < Peon 
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The case of m = 4 is comparatively open. As above, the probability method 
yields 
n \3/? 
r(C4, Kn) >C (<.) (n — oo). 


In [34] we proved that r(C4, K,,) = 0(n”), and an unpublished observation of 
Szemerédi yields 


(C4, Kn) <e (2) - (2) 


It is expected that 
r(C4, Ky) = 


n00 1(03,Kn) 


but the known bounds [(1) with m = 3 and (2)] just fail to produce such a 
result. 


Conjecture 1. There exist an €¢ > 0 such that r(C4, Kn) < n?~* for all 
sufficiently large n. 


From Theorem 2 and (1) it follows that if m is fixed, 
r(Cyas Kn) > r(Com—1, Kn) and r(Cm, Kn) = r(Com; Kn), 


for all sufficiently large n. In view of this, it may be that for n appropriately 
large, r(C,, K,) first decreases as m increases. In view of Theorem 1, we 
know that r(C,,, K,,) ultimately increases with m. It may be that there is a 
unique value of m at which the minimum is attained. 


Question 2 ([34]). For n fixed, where is the minimum value of r(Cm, Kn) 
attained ? 


Notable progress has been made on some of the cycle—complete graph 
problems. 


Answer 1 ((51]). J. H. Kim has proved that 
r(K3, Kn) = 17(C3, Kn) = O(n? /logn), nm — oo. 
Answer 2 ((53]). V. Nikiforov has proved that ifm > 4n+2 then 
r(Cm, Kn) = (m—1)(n—-1) +1. 


2.3. “Goodness” in Generalizes Ramsey Theory 


Let F be a graph with chromatic number y(F’). The chromatic surplus of 
F, denoted s(F), is the least number of vertices in a color class under any 
proper y(f)-coloring of the vertices of F. If G is any connected graph of 
order n > s(F’), then 
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WEG) 2 OF) =e 1) + a4), 
The example which establishes this bound is simple. 
Example 1. With p = (x(F) —1)(n—1)+s(F) — 1, two-color the edges of 
K,, so that the blue graph consists of x(£') disjoint complete graphs, x(F') —1 


of order n — 1 and one of order s(F) — 1. In this two-coloring, there is no 
connected blue subgraph of order n and there is no red copy of F. 


In case r(F, G) = (y(F) —1)(n—1)+8(F), we shall say that G is F-good. 
Chvatal observed that for each m and each tree T of order n, 


r(Km,T) = (m—1)(n—1) +1. 


In other words, every tree is K,,-good. In large part, the study of “goodness” 
in Ramsey theory is an attempt to see how far this simple result of Chvatal 
can be generalized. Known cases where G is F-good involve some restriction 
on the number of edges and/or maximum degree of G [15, 16, 37]. 


Theorem 3 ([38]). Given k,A and s = py < --: < pm, there exists a 
corresponding number no such that every connected graph G with n > no 
vertices, q < n+k edges and maximum degree < A is K(pi,po,-.--,Pm)- 
good. 


Corollary 1 ([38]). Let F be a fixed graph with chromatic number x and 
chromatic surplus s. Set a = 1/(2|V(F)| — 1). Then there are constants Ci 
and C2 such that for all sufficiently large n, every connected graph G of order 
n satisfying |E(G)| <n+Cyn® and A(G) < Con® is F-good. 


The edge density of a graph G is defined to be max{q(H)/p(H)}, where 
p(H) and q(H) denote the number of vertices and edges, respectively, of H, 
and the maximum is taken over all subgraphs H Cc G. As usual, A(G) denotes 
the maximum degree in G. 


Conjecture 2 ([38]). Let F be an arbitrary graph and let (G,,) be a sequence 
of connected graphs such that (i) G, is of order n, (ii) A(Gn) = o(n) and 
(iii) each graph in (G;,) has edge density at most p (a constant). Then Gy, is 
F-good for all sufficiently large n. 


Conjecture 3 ([38]). If F is an arbitrary graph and (G,) is a sequence 
of connected graphs such that A(G,,) is bounded, then G,, is F'-good for all 
sufficiently large n. 


If all trees are F' good, it is natural to ask for the largest integer q = 
f(F,n) so that every connected graph with n vertices and at most q edges is 
F-good. For F = K3 we have the following result. 


Theorem 4 ((10]). Let f(K3,n) be the largest integer q so that every 
connected graph with n vertices and at most q edges is K3-good. Then 
f(K3,n) > (17n+1)/15 for alln > 4 and f(K3,n) < (27/4+¢)n(logn)? for 
all sufficiently large n. 
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Question 3 ((10]). Does f(K3,n)/n tend to infinity with n? 


Also, it is appropriate to ask for the largest integer g = g(F,n) so that 
there exists a connected graph with n vertices and q edges that is F-good. The 
following result is known concerning f(Km,n) and g(Km,n) where m > 3. 


Theorem 5 ([10]). For each m > 3, there exist positive constants A, B,C, D 
such that 
ee 4A 
n+Anm-1 < f(Km,n) <n+ Bn™4 (logn)’, 
and 


mi m+2 (m+1)(m=2) 
Cnm-1 <9(Km,n)< Dn ™ (logn) m1) 


for all sufficiently large n. 


The example that provides the lower bound for g(K3,n) is easily 
described. (A similar example works in general to provide a lower bound 
for g(Km,n).) 


Example 2 ((10]). Choose t and s so that r(K3,tKs) < n—1. Then set 
G = Kk, +4 where H is the graph consisting of t disjoint copies of Ks 
together with n — 1 — ts isolated vertices. Then G is a connected graph with 
n vertices and t($) +n—1 edges that is K3-good. 


Are there cases where this example is best possible? 


Question 4. For what values of n (if any) is g(K3,n) the number of edges 
in the above example where s and t have been chosen so that i) is maximized 
(or possibly a generalized version in which there are t complete graphs 
Ks,,Ks,,-.-,Ks,, not all of the same order)? 


In an influential paper [6], Burr and Erdés gave six questions, the 
resolutions of which would in all likelihood require new and widely applicable 
techniques. One of these six asks “is the n-dimensional cube K3-good for all 
large n?” This question is still open. However, the remaining five are proven 
to be true in a paper by Nikiforov and Rousseau [54]. The methods of proof 
are indeed modern—a mix of the regularity lemma, embedding of sparse 
graphs, Turan type stability, and other structural techniques. They define a 
class of graphs “degenerate and crumbling” and prove that all sufficiently 
large graphs from this class are Kp-good. This large class of “degenerate and 
crumbling” graphs contains well know classes of graphs referenced in [6]. Two 
examples of a specific results from this paper is the following. 


Answer 3 ([54]). If each edge of Ky is subdivided by one vertex, then the 
resulting graph is K,p-good for p fixed and n large? 


Answer 4 ([54]). Suppose K > 1, p> 3 and T,, is a tree of order n where 
k= (ki,...,kn). If0 < ki < K for alli € [n] then TX is Ky-good for n large, 
where TX is the graph obtained from T,, by replacing vertex i with a clique 
Kp, and replacing and an edge ij with a complete bipartite graph Kx,,x,- 
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2.4. Graph Versus Tree 


If T is any tree, r(C4, T) is determined by r(C4, Ky,ar)), where A(T’) denotes 
the maximum degree in T. 


Theorem 6 ([17]). Let T be any tree of order n. Then 
r(C4,T) = max{4,n4+1,7r(C4, Ky acr))}- 


In view of the importance of r(C4, K1,,), it would be helpful to have more 
precise information concerning this Ramsey number. It is easy to prove that 


(Cas Bay) < n+ [V7] <1 1 

and in [17] we obtain 

r(Ca, Kin) > n+ Vn 6ni/?, (3) 
for all sufficiently large n. The proof of the later result uses the fact that 
for any prime p there is a C4-free graph G, of order p? + p+ 1 in which 
each vertex has degree p or p+ 1. By letting p be the smallest prime greater 
than \/n and randomly deleting a suitable number of vertices from G,, we 
obtain a graph that yields (3). The bound is based on information about the 
distribution of primes. Let p; denote the k-th prime. If has been conjectured 


that pei1— pr < (log p,)* for some constant a. If this conjecture is true, the 
lower bound in (3) can be improved to n + /n — 6 (4 logn)”. 


Question 5 ([{17]). Js it true that r(C4, Kin) <n+./n-—c holds infinitely 
often, where c is an arbitrary constant? 


Paul Erdds offers $100 for a proof (one way or the other). 
Question 6 ([{17]). Is it true that r(C4, Kin4i) < r(Cs, Kin) +2 for all n? 


In [41] it is proved that ifn > 3m —3 then r(K(1,1,m),T) = 2n-1 
for every tree T of order n. This was shown to be the case when T is a star 
(K1.n-1) by Rousseau and Sheehan [55] who also proved that the condition 
n > 3m— 3 cannot be weakened. This is one of many examples in Ramsey 
problems involving trees where the star turns out to be the worst, i.e. largest 
Ramsey number, case. How general is this phenomenon? 


Conjecture 4. If G is fixed and n is sufficiently large, r(G,T) < r(G, 
Ki n-1) for every tree T of order n. 


What are the graphs G relative to which all sufficiently large trees are 
good? An early result showed that this class of graphs includes all those in 
which some y(G)-coloring has two color classes consisting of a single vertex. 


Theorem 7 ([7]). Let G be a graph with y(G) > 2 that has a vertex x(G)- 
coloring with at least two color classes consisting of a single vertex. Then 


GD) (XG) =D) Tl 
for every tree T of order n > no(G). 
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A complete characterization for the case of s(G) = 1 has been obtained 
by Burr and Faudree [18]. 


Theorem 8 ([18]). Let G be a graph with chromatic number k. Then for all 
sufficiently large n, every tree T of order n satisfies r(G,T) = (k—1)(n—1)+1 
iff there is an integer m, such that G is a subgraph of both the chromatic 
number k graphs Kim,m 


If G = Kmy,mo,....m, With m, = 1, the fact that G has chromatic number 


k and chromatic surplus 1, gives the lower bound (k—1)(n—1)+1 and there 
is an upper bound that depends only on r(Kym,,T). 


Theorem 9 ([42]). For all sufficiently large n, every tree T of order n 
satisfies 


DoH D417 <7 hie, a) < b-— DiGi) 41. 


aces 


For m, # 1 it is natural to suspect the following generalization. 


Conjecture 5 ([42]). If mi < mz <--- < mz and T is any sufficiently 
large tree, 


IA 


Trac atin cmgsl (k _ 1)(r(Kimy,m2;T) _ 1) +m}. 


The decomposition class of a graph G, denoted B(G), is defined as follows: 
a given bipartite graph B belongs to B(G) whenever there exists a y(G) 
vertex coloring such that B is the bipartite graph induced by some pair of 
color classes. Let r(6(G),T) denote the smallest integer r so that in every 
two-coloring of E(K,.) there is at least one member of B(G) in the first color 
or a copy of T in the second color. In [40] it is shown that if B is a fixed 
bipartite graph and 71, 7>,73,... is any sequence of trees where T;, is of 
order n, then r(B,T,,) = n+ o(n) as n + oo. Thus r(B(G),T,) = nF o(n). 
The result of Faudree and Burr can be written as follows: if B(G) contains 
Ky, then for all sufficiently large n, every tree T of order n satisfies r(G,T) = 
(x(G) — 2)(n—1)+7r(B(G), T). It would be interesting to know how generally 
this formula holds. 


Question 7. What graphs G satisfy 
r(G,T) = (x(G) — 2)(n— 1) + r(B(G), T) 


for every tree T of order n > no(G)? 


2.5. Trees 


Some very interesting open problems concern the diagonal Ramsey number 
r(T) where T is a tree. It has been conjectured by Erdés and Sds that each 
graph with m vertices and at least (n — 2)m/2 edges contains every tree T’ 
of order n; if true, this yields r(T’) < 2n — 2. A general lower bound for r(T’) 
is given by the following example. 
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Example 3. Assume a < b. Construction (1): Two-color the edges of 
Koa+p—2 80 that the red graph is Ka—1 U Ka4o-1. Construction (2): Two- 
color the edges of K2p-2 so that the red graph is 2Ky_1. In each of these 
two constructions, there is no monochromatic connected bipartite graph with 
a vertices in one class and 6 vertices in the other. 


In view of this example, we see that any tree T with a vertices in one 
color class and 6 in the other satisfies 


r(T’) > max{2a + b— 1,2b— 1}. 


The broom B;, is the tree on k + | vertices obtained by identifying an 
endvertex of a path P; on / vertices with the central vertex of a star Ky, on 
k edges. 


Theorem 10 ([36]). 


(1) r( Bea) =k + [31/2] -—1 for k>1 and 1> 2k, 
(2) r(Bea) < 2k +1 for 5<1< 2k. 


From the lower bound given by Example 3, 
2k + 2|1/2| —1 when l < 2k—1, 


r(Br1) > 
2k + 2|1/2| when | = 2k — 1. 


Thus in case (2) of Theorem 10, r(B,,7) differs from the upper bound by at 
most 2. 

The smallest value of max{2a + b — 1,2b — 1} occurs when 2a = 6, so 
Example 3 shows that r(T) > [4n/3] for every tree of order n. Moreover, 
this bound cannot be improved since equality holds for certain booms. 


Question 8 ((36]). Js r(T’) = 4a for every tree with a vertices in one color 
class and 2a in the other? 


A special case of the previous question is when T is the tree obtained 
from subdividing a — 1 edges of the star Ky 24 [24]. 


2.6. Multicolor Results 


When the list of graphs (Gi, Go,...,G_-) is restricted to complete bipartite 
graphs, odd cycles, and exactly one large cycle (even or odd), there is an 
exact formula for r(G1, Go,...,G). 


Theorem 11 ([33]). Let [B] and [C] denote the following fixed sequences of 
graphs: 


[B] = (Koei. ** Koc.) bi < Gi GG = Vy2jaing 8), 
[C] = (Coa. 41,--+, Coa, 41)- 
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Further, let £= S*>_,(b; — 1), and require that d; > 2°"! for i = 1,...,t. 


Then, if n is sufficiently large, r(Cy, [B], [C]) = 2*(n + €—1). 


All of the results in [33] are for the case where exactly one cycle length 
is large in comparison with the orders of the remaining graphs ([B], [C]). It 
would be quite helpful to have corresponding results when there are two or 
more large cycles. 


Question 9 ([33]). What is r(Cn, Cm, |B], [C]) when both n and m are large 
in comparison with the orders of the remaining graphs? 


Let [Coda] = (Coa, 41;---;C2a,41)- Then r(< [Coaa]) denotes the smallest 
number r such that in every t-coloring of the edges of K,, there is an odd 
cycle of length at most 2d; + 1 in the ith color for some 7. Using the proof 
technique of Theorem 11, we have the following multicolor Ramsey result for 
cycles. 


Corollary 2 ([33]). Let [Coven] = (C2v,,---;C2v,); [Coda] = (Coa, 41;---5 
Coa,41), and let = >;_,(b; — 1). If n is sufficiently large, 


r(Ch; [Ceven], [Coaa]) = (r(< [Coaa]) — 1)(n + 2-1) +1. 


For the case of three colors, the following results were found for r(< 
[Coaa]). 
Theorem 12 ([33]). Let €>k>m. Then 
9k>4,1>2,m>1, 
T(S< (Corsi, Carz1, Cam4i)) = 
11k >5,l=m=1. 


In the remaining cases, 
T(< (Corsi, Cari, Cam4i)) = 


for (respectively) 

(3, 3, 3), (3, 3, 2), (3, 3, 1), (3, 2, 2), (2, 2, 2), 
(3, 2, 1), (2, 2, 1), 

(4, 1,1), (3,1, 1), (2,1, 1), 


b 9 


(k, l,m) = 


(1, 1,1). 


By considering a bipartite decomposition of a complete graph, it is clear 
that, in general, 


r(< (Com41) eee ,Com,+1)) > Qt +1. 


104 Ralph J. Faudree, Cecil C. Rousseau, and Richard H. Schelp 


Note that r(< (Cop41, Cais1,Com41)) = 22 +1 = 9 except for (k > 5), 
1 = m= 1 and six special cases. 


Question 10. When does r(< (Com,41,°+* ,Cam.41)) = 2° +1 hold? In 
particular, what is true for t = 4? 


2.7. Ramsey Size Linear Graphs 


A graph G is Ramsey size linear if there is a constant C' such that 
r(G,H)<Cn 


for every graph H with n edges. The following result is easily proved using 
the Erdés-Lovasz local lemma and the techniques of Spencer [57]. 


Theorem 13 ((39, 44]). Given a graph G with p vertices and q edges, there 
exists a constant C such that for n sufficiently large, 


ee 
Neen Ce 


Corollary 3. If G is a graph with p vertices and 2p—2 edges, then G is not 
Ramsey size linear. 


In the other direction, we have the following result. 


Theorem 14 ((44]). If G is any graph with p vertices and at most p+ 1 
edges, then G is Ramsey size linear. 


There are various small graphs for which the question of Ramsey size 
linearity is open. 


Question 11. Are the following graphs Ramsey size linear: K33, Hs (the 
graph obtained from Cs by adding two vertex disjoint chords), Q3 
(the 3-dimensional cube)? 


Question 12 ([44]). If G is a graph such that each subgraph on p vertices 
has at most 2p — 3 edges, then is G Ramsey size linear? 


The graph Ky, is not Ramsey size linear, but the deletion of any edge 
gives a Ramsey size linear graph. It would be of interest to find a specific 
infinite family of graphs with this property. 


2.8. Repeated Degrees and Degree Spread 


If n > r(G), and the edges of K,, are two-colored, there must be at least 
one monochromatic copy of G. If there must be such a monochromatic copy 
of G in which two vertices have the same degree in the two-colored K,,, at 
least for all sufficiently large n, then we say that G has the Ramsey repeated 
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degree property. In [2] it was shown that K,, where m > 4 does not have this 
property. 


Theorem 15 ([20]). For each m > 1, the graphs Km,m and Com+41 have the 
Ramsey repeated degree property. 


Question 13. Are there graphs other than K,, for m > 4 that fail to have 
the Ramsey repeated degree property? 


The degree spread of X C V(G) is the difference between the largest and 
smallest degree (in G) over all vertices in X. Given graphs G and H and 
a two-coloring of the edges of K, with n > r(G,H), the degree spread of 
the coloring is the minimum degree spread over all vertex sets of copies of 
G and/or H that are appropriately monochromatic. Then ®,(G,H) is the 
maximum degree spread over all two-colorings of E(K,,). 


Theorem 16 ([20]). If n > 4(r — 1)(r — 2) where r = r(G,H), then 
®,(G,H) =r(G, H) - 2. 


Question 14 ([20]). What is the smallest n for which ©,(G,H) = 
r(G, H) — 2? Does this relation hold for alln > r(G, H)? 


3. Size Ramsey Numbers 


3.1. Definitions and Notation 


Write F — (G,H) to mean that in every two-coloring of the edges of F, 
there is a copy of G in the first color or a copy of H in the second. The 
size Ramsey number of the pair (G,H), denoted 7(G,H), is the smallest 
number q = 7(G) for which there exists a graph F’ with q edges satisfying 
F — (G,#H). In case G = H we write F > G for the “arrow” relation and 
*(G) for the size Ramsey number. The size Ramsey number was introduced in 
[35]. Interestingly enough, the size Ramsey number of K,, is simply expressed 
in terms of its ordinary Ramsey number: 


(See [35].) Questions concerning 7(P,,) were raised in [35]. In [3] Beck proved 
the rather surprising fact that *(P,,) < Cn for some constant C 


3.2. Complete Bipartite Graphs 


The asymptotic behavior of the size Ramsey number of Ky», is studied in [48]. 


Theorem 17 ([48]). For alln > 6, 


2 
i 2” < F(Kyn) < 52". 
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The upper bound comes from the fact that Ka,» with a = |n?/2| and 
b = 3n2” arrows Ky» for n > 6. The lower bound comes from a simple 
application of the probabilistic method using a bound for the number of 
copies of K,,, in any graph with q edges. Of course, the immediate goal is 
to bring the bounds for #(Ky,,) to within a constant factor. 


Question 15. Is there a choice of a = a(n) and b = b(n) so that ab = 
O(n?2") and Ka» 4 Knin? 


3.3. Stars and Star-Forests 


Theorem 18 ([8]). For positive integers k,l,m, and n, 
(mK xn, ny.) = (m+n—1)(k+1-1). 


Moreover if F > (mK14,nK1,2) and F has (n+m-—1)(kK+1—1) edges, then 
F=(m+n—1)Kixge-1 ork =1=2 and F=tK3U(m+n+t-1)ki3 
for some 1<t<m-+n-1. 


This leaves open the size Ramsey number for a pair of star forests. Assume 
that 


s 
f= LJ Kin, (ny >ng2>°++>ns) and 
i=l 


t 
fy = Ses (m, > m2 >--- > mz). 
i=l 


For2<k<s+tset l, =max{nj+m,;—1:i+ j =k}. It is not difficult to 
prove that U;*$ Ki, > (Fi, F2) so 7(Fi, Fa) < ets de. 

stt 
Conjecture 6 ([8]). 7(F\, Fb) = > Ip. 

k=2 

Note that if n; = n for all i and m; = m for all j, then the conjectured 

value agrees with the number 7(sKy,,,tKi,m) = (m+n-—1)(s+t-—1) given 
in Theorem 18. 


3.4. Star Versus Complete Graph 


The following result is easily established. 


Theorem 19. The arrow relation Kon41 — Kn > (Kin, K3) holds for each 
n> 2. 


Possibly Kon41 — Ky is an arrowing graph with the minimum possible 
number of edges, and thus 7#(Ky,,, K3) = Ca) - (5) = nie To verify 
this, it would be sufficient to prove the following conjecture. 
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Conjecture 7 ([23]). For n > 3, any graph with ae — (5) — 1 edges is 
the union of a bipartite graph and a graph with maximum degree less than n. 


It is not difficult to verify this conjecture if the graph has at most 2n+ 1 
vertices. 


3.5. Graph Versus Matching 


No general result is known for 7(G,tK2), but some information about this 
number can be obtained by considering 

: . *(G,tK2) _ *(G,tK2) 

oo (G) = lim ———— = lim ~~. 

Foo(G) = im 1G, Ke) ~ o2. EQ) 
Clearly, 0 < f..(G) < 1. The following results were obtained in [25]. 
Theorem 20 ([25]). The set {7..(G) : G is a connected graph} is dense 
in [0,1]. 


Theorem 21 ([25]). For each n > 2, there is a corresponding positive 
constant a such that 


1 a 
— Ss Poo Ph es n < a 
not (Pn); Foo (Cn) : 
Also 
1 2 2 2 
ae ee Pees 
nr nm n nr 


Thus limy oo foo(Hn) = 0 for Hy = Pr,Cn, Kn or Ky. On the other 
hand, we can show that lim,_,., foo(K1,.,) = 1. This suggests the following 
question. 


Question 16 ([25]). Does limy +o foo(Gn) = 0 hold for every sequence of 
graphs (G,,) such that |V(Gn)| 4 co and A(G,,) is bounded as n + co? What 
sequences (G,) yield limp—.co Foo(Gn) = 1? 


4. Other Ramsey Problems 


4.1. Multiple Copies 

Assume F' + (mG, H) where mG denotes m vertex disjoint copies of G. How 
many copies of G must F' contain? 

Theorem 22 ([8]). If F > (mG, H), then tG C F where 


yo Keene +|V(A)| — B(A) — | 
|V(G)| 
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Question 17 ((8]). Jf F — (nG), then must F contain at least Lee 
copies of G? 


We have shown that if G and e > 0 are fixed then for all sufficiently large 
n, every graph F' satisfying F’ + (nG) contains at least r(nG)(1 —«¢)/|V(G)| 
copies of G. 


4.2. Ramsey Minimal Problems 


Let F,G,H be graphs without isolates. The graph F is (G, H)-minimal if 
F + (G,H) but F —e #A (G,H) for any edge e of F. The pair (G, H) 
is called Ramsey-finite or Ramsey-infinite according to whether the class of 
all (G, H)-minimal graphs is a finite or infinite set. It has been shown by 
NeSetiil and Rédl [52] that (G,H) is Ramsey-infinite when at least one of 
the following hold: 


(i) G and H are both 3-connected. 
(ii) x(G) and x(H) are both > 3. 
(iii) G and H are both forests, neither of which is a union of stars. 


Statement (iii) of the above result has been strengthened in the following 
way. 


Theorem 23. (i) [14] The pair (G, H) is Ramsey-infinite when both G and 
HT are forests, at least one having a non-star component. 

(ii) [11] If G and FH are star forests with no single edge star (Kz), then 
(G,H) is Ramsey-finite if and only if both G and H are single stars, 
each with an odd number of edges. 

(iii) [12] The pair (Kim UkKe, Ki, U lK2) is Ramsey-finite when both m 
and n are odd. 


Subsequently, Faudree characterized Ramsey-finite pairs of forests. 


Theorem 24 ([49]). If G and H are forests, then (G,H) is Ramsey-finite 
if and only af 


6= (Um) Unt Mm, > M2 >°:->mMs > 2, m>O0, 


i=l 


t 
H= (Ursin. )Untia meng 27 >my22,8>t>0, 


i=1 
and one of the following hold: 
(1) t=0 andn > 0, 


(2) s=t=1 and n,n, are odd, 
(8) 8 >2,t=1, m,n are odd, my > ny +me2—-1 andn > no = no(G, H). 
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It should be noted that the precise value of no(G, H) in (3) is not known. 
This is the only place where the Ramsey-finiteness question for a pair of star 
forests is not explicitly settled. 

The above results leave a considerable gap when either G or H has 
connectivity two or less. It is likely that (G,H) is Ramsey-infinite for all 
pairs not identified above as Ramsey-finite. 


Conjecture 8 ([13]). The pair (G, H) is Ramsey-infinite unless both G and 
FT are stars with an odd number of edges or at least one of G and H contains 
a single edge component. 


An interesting case of the above conjecture is when G is a cycle and H 
is two-connected. No technique is presently known for showing such a pair is 
Ramsey-infinite. 

Other results concerning Ramsey finiteness were obtained in [9] and [11]. 


Theorem 25. (i) [9] If G ts a matching and H is an arbitrary graph, then 
(G, H) is Ramsey-finite. 
(ii) [11] The pair (K14,,G) is Ramsey-infinite if k > 2 and G is a two- 
connected graph. Also (K1.2,H) is Ramsey-infinite if H is a bridgeless 
connected graph. 


Conjecture 9 ([10]). The pair (K1.2,G) is Ramsey-finite only if G is a 
matching. 


Conjecture 10 ((10]). Jf (G,H) is Ramsey-finite for each graph H, then G 
must be a matching. 


There is no known case where adding independent edges to G (or H) 
changes a Ramsey-finite pair (G, H) to a Ramsey-infinite pair. 


Conjecture 11 ([13]). Jf (G,H) is Ramsey-finite then (GU K2,H) is 
Ramsey-finite. 


5. Extremal Problems 


5.1. Edge Density and Triangles 


The classical result of Turan implies that any graph with n vertices and at 
least |n?/4|+1 edges contains a triangle. A natural generalization is obtained 
by fixing a between 0 and 1 and asking for the minimum number f.(n) so 
that a graph of order n must contain a triangle if each set of |a@n| vertices 
induces a subgraph with at least f.(n) edges. 


Example 4 ([45]). Let M; = Keo, Mz = Cs, and let M3, be the graph 
obtained from Cg by adding all of the long chords. From M, obtain a graph 
H;, of order n = k|V(M;)| by replacing each vertex of M; by an independent 
set of k vertices, and making each pair of vertices of H; in different expanded 
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sets adjacent if and only if the replaced vertices are adjacent in M;. Each 
graph H; is extremal triangle-free in that the addition of any edge creates a 
triangle. 


Note that the minimum number of edges induced by a set of n/2 vertices 
in Hz is n?/50. This leads to the following conjecture of Erdés. 


Conjecture 12 ([45]). Jf each set of |n/2| vertices in a graph G of order n 
spans more than n?7/50 edges, then G contains a triangle. 


An analysis of the graphs H,, Hz, and H3 leads to the following more 
general conjecture. 


Conjecture 13 ([45]). Suppose 17/30 < a < 1 and B > (2a —1)/4 or 
43/120 < a < 17/30 and 8 > (5a — 2)/25. For all sufficiently large n, a 
graph of order n in which each set of |an| vertices spans at least Bn? edges 
must contain a triangle. 


In [45] this conjecture is proved for .648...< a < 1; otherwise it is open. 
The case of a = 1/2 is of particular interest. 

One direction would be to extend these results from triangles to arbitrary 
complete graphs. For a = 1/2, the following conjecture has been made by 
Chung and Graham. 


Conjecture 14 ([22]). Let b:(n) denote the minimum number of edges 
induced by any set of n/2 vertices in the Turdn graph on n vertices for K;. 
If each set of |n/2| vertices in a graph G of order n spans more than b;(n) 
edges, then G contains a Ky. 


5.2. Minimal Degree 


The wheel W,, = K, + Cy_1 has n vertices, 2n — 2 edges, and minimum 
degree 3, but any proper subgraph of W,, has a vertex of degree less than 3. 
On the other hand, it is easy to prove that any graph with n vertices and 
2n — 1 edges has a proper subgraph of minimum degree at least 3 [27]. 


Theorem 26 ([43]). If G is a graph with n vertices and 2n—1 edges, then 
G contains a subgraph with less than n — ,/n/48 vertices that has minimum 
degree at least 3. 


A stronger result has been conjectured. 


Conjecture 15 ([43]). There is an ¢ > 0 such that every graph with n 
vertices and 2n — 1 edges has a subgraph with (1 —«¢)n or fewer vertices 
that has minimum degree at least 3. 


A more general version of Theorem 26 was proved in [43] and indepen- 
dently by other authors. 
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Theorem 27 ([43]). For k > 2, any graph with n vertices and (k — 1)(n— 
k+2)+ ng + 1 edges contains a subgraph with at most n — |\/n/(6k) | 
vertices that has minimum degree at least k. 


The generalized wheel W(k —2,n) = Ky—2+Cy—x+2 Shows that the number 
of edges in the last result cannot be reduced. Conjecture 15 has the following 
generalization for subgraphs of minimum degree at least k > 2. 


Conjecture 16 ([43]). For each k > 2, there is a corresponding ¢ > 0 such 
that every graph with n vertices and (k — 1)(n —k+2)+ Ce) +1 edges 
has a subgraph with (1 — e€)n or fewer vertices that has minimum degree at 
least k. 


5.3. Odd Cycles in Graphs of Given Minimum Degree 


Example 5 ((29]). Let k > 3 be an odd integer and let H be the two- 
connected nonbipartite graph obtained from Cy42 by replacing each of its 
k +2 vertices by an independent set of size s. Then H is a graph of order 
n = (k+2)s that contains no odd cycles of length < k and contains all possible 
odd cycles of larger length. Also H is regular of degree 2s. 


Theorem 28 ((29]). Let k > 3 be an odd positive integer. There exists 
f(k) such that ifn > f(k) and G is a two-connected nonbipartite graph 
of order n with minimum degree > 2n/(k +2), then either G contains a Cy 
orG= H. 


If G is a two-connected nonbipartite graph of appropriately large order n 
and 6(G) > 2n/(k+ 2), what range of odd cycles must G contain? 


Example 6 ([29]). Seta = [2n/(k+2)] andb = n—a. The graph G obtained 
by adding an edge to the smaller part of Ka,» contains Ca.41 for1<t<a-l, 
but no odd cycle of length greater than 2a — 1. 


Thus the best possible result would show that, aside from H, any two- 
connected nonbipartite graph with large order n and minimum degree at 
least 2n/(k + 2) contain odd cycles of all lengths in an a range up to about 
4n/(k +2). The next result shows that such a result is true (asymptotically) 
if the factor 4 in the last statement is replaced by 8/3 — o(n). 


Theorem 29 ([29]). Given 0 < c < § and0 <e <1, there exist hi(c,€) 
and hg(c,€) so that for alln > ha(c,€) every two-connected nonbipartite graph 
G of order n and minimum degree 6(G) > cn contains Cot41 for hi(c,e) < 


2t+1< 41 —e)en/3. 
Is Example 6 really the truth? 


Question 18 ([29]). If n is appropriately large and G #4 H is a two- 
connected nonbipartite graph of order n satisfying 6(G) > 2n/(k +2), must 
G contain Cor41 for k < 2t+1 < 2/2n/(k+2)] —1? 
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5.4. Vertices and Edges on Odd Cycles 


Every graph with n > 3 vertices and at least |n?/4| + 1 edges contains a 
triangle, and in fact a cycle of length 2k + 1 for every k < |(n+1)/4]. For 
such a graph, how many vertices (edges) must lie on triangles and, more 
generally, on cycles of length 2k + 1? 


Theorem 30 ([32]). Let G be a graph with n vertices and at least |n?/4| +1 
edges. (a) At least |n/2| +2 vertices of G are on triangles, and this result 
is sharp. (b) At least 2|n/2| +1 edges of G are on triangles, and this result 
is sharp. (c) If k > 2 and n > max{3k(3k + 1), 216(3k — 2)}, then at least 
2(n — k)/3 vertices of G are on cycles of length 2k + 1, and this result is 
asymptotically best possible. (d) If k > 2 is fixed, then at least 11n?/144—O(n) 
edges of G are on cycles of length 2k +1. 


The example used to show that the result in (c) is asymptotically best 
possible suggests what the sharp result should be in place of (d). 


Example 7. Consider the graph K,UT2(n — s) where s = [2n/3] +1 and 
T2(m) denotes the complete bipartite graph with parts |m/2| and [m/2] (the 
Turan graph). This graph has [2n/3]+ 1 vertices on cycles of length 2k +1 
and ~ 2n?/9 edges on cycles of length 2k + 1. 


Conjecture 17. Ifk > 2 is fixed, then as n — oo every graph with n vertices 
and |n?/4| +1 or more edges has at least 2n?/9 — O(n) edges on cycles of 
length 2k+1? 


5.5. Extremal Paths 


Let m,n and k be fixed positive integers with m > n > k. We wish to find 
the minimum value of | such that each m vertex graph with at least | vertices 
of degree > n contains a P,,,. A plausible minimum value for / is suggested 
by the following construction. 


Example 8. Let m = t(n+1)+r where0 <r<n+1 and assume k < 
2n+1. Let s = |(k—1)/2|. The graph consisting of t vertex disjoint copies 
of H = Knii_s + Ks and r isolated vertices contains ts vertices of degree n 
and no Pri. When k is even andr+s>n, the number of vertices of degree 
> n in this graph can be increased by 1 to ts+1 without forcing the graph 
to contain a P41. Simply take one of the vertices of degree s and make it 
adjacent to each of the r isolated vertices. 


Theorem 31 ([46]). Let m and n be positive integers such thatn+1<m< 
2n+1. If G is of order m and contains at least k vertices of degree > n, then 
G contains (i) a Pop_s ifk <n/2+8, (tt) a Pass if (n+1)/24+3<k<n, 
and (wt) a Py ifn+1<k. 


Conjecture 18 ([46]). Let m,n and k be positive integers with m >n>k 
and set 6 = 2 if k is even and 6 =1 if k is odd. If G is a graph of order m 
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and at least | = |(k — 1)/2||m/(m+1)| + 6 vertices of degree > n, then G 
contains a Pri. 


5.6. Degree Sequence and Independence in K,4-Free Graphs 


For a graph G with 6(G) > 1 let f(G) denote the largest number of times 
any entry in the degree sequence of G is repeated. If G is K,-free graph and 
f(G) < k, what is the smallest possible value of the independence number 
B(G)? 

Theorem 32 ((31)). 


(i) If G is a K3-free graph of order n and f(G) < k, then B(G) > n/k. 
(ii) [fr >5 andk > 2, then there exists a sequence (G,) of K,.-free graphs 
such that |V(G,,)| =n and f(G,) <k for each n and B(G,,) = o(n) as 
n> ow. 
) If G is a K4-free graph of order n and f(G) = 2, then B(G) > n/12. 
(iv) There exists no sequence (G,) of K4-free graphs such that |V(G,)| =n 
and f(G,) = 3 for each n and B(G;,) = o(n) as n > oo. 


Bollobaés has shown that there exists a sequence (G,) of K4-free graphs 
such that |V(G,,)| = n and f(G,) > 5 for each n and 6(G,,) = o(n) as 
n — oo [4]. The following questions are open. 


Question 19 ([31]). Does there exist a sequence (G,,) of K4-free graphs such 
that |G,,)| =n and f(G,) = 4 for each n and B(G;,) = o(n) as n > co? 


Question 20 ([31]). If G is a K3-free graph of order n with f(G) < k, is 
the lower bound B(G) > n/k best possible? 


6. Other Problems 


6.1. Monochromatic Coverings 


If A and B are sets of vertices in a given graph G, we say that A dominates 
or covers B if for every y € B\ A there is an x € A such that zy € E(G). 
The following result was conjectured by Erdés and Hajnal. 


Theorem 33 ([28]). For any fixed t, in every two-coloring of the edges of 
Ky, there exists a set of t or fewer vertices that monochromatically dominates 
at least n(1 — 2~*) vertices. This result is essentially sharp. 


In [26] we studied monochromatic domination when more than two colors 
are involved. For three colors, we have the following result. 


Theorem 34 ([26]). Three-color the edges of K,. Then in at least one color, 
there is a set consisting of 22 or fewer vertices that dominates a set of at least 
2n/3 vertices. 
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The fact that no small set will, in general, monochromatically dominate 
more than two-thirds of the vertices comes from the following example due 
to Kierstead. 


Example 9 (Kierstead). Three-color the edges of K3m as follows. Form 
the partition V(K3m) = (A,,A2,A3) and for i = 1,2,3 (mod 3) color the 
internal edges of A; and all the edges between A; and Aj,1 with colori. Then 
each monochromatic subgraph is isomorphic to Km +K», and there is no set 
that monochromatically dominates more than 2m vertices. 


It is possible that Theorem 34 holds with “22” replaced by “3.” 


Question 21. Is it true that in every three-coloring of the edges of K,, there 
is a set of three vertices that monochomatically dominates two-thirds or more 
of all vertices? 


6.2. Spectra 


A nonnegative integer q belongs to the k-spectrum of a graph G if there 
is some set of & vertices in G spanning q edges. Denote the k-spectrum of 
G by s,(G). Then s,(G) C {0,1,--- cape but not every such subset is an 
example of a k-spectrum. Let nz denote the number of distinct subsets of 
{O,1---, oy that are realizable as k-spectra. The k-spectra of all large trees 
were characterized in [47]. 


Theorem 35 ([47]). Ifn > max{2k—1,3k—5}, then for each tree T of order 
n there exist corresponding integers r and s withO < [r/2]} <s<r<k-1 
such that s;(Tn) = [0,r] U [s,k — 1]. 


The following bounds have been obtained for number of realizable 
k-spectra. 


Theorem 36 ((47]). For any integer k > 2, the number of distict realizable 


k-spectra satisfies i (aye <n < 9g(2)+1. 


For k = 2,3 and 4, the realizable k-spectra have been determined [50]. 


Question 22. For k > 5, what is nz, the number of distinct realizable 
k-spectra? 


A similar problem was asked by Erdés and Faudree concerning the cycle 
spectrum of a graph, namely the set of distinct cycle lengths of a graph. Let 
Cn, be the number of distinct cycle spectra for graphs of order n. The following 
family of graphs gives a lower bound for the cy. 


Example 10. Let v be a verter in C,, and let S C {2,3,...,|n/2|}. Add to 
C,, all chords extending from v whose lengths belong to S. Distinct sets S will 
give distinct cycle spectra, since the cycles of length at least [n/2+1] will be 
different. 
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Using this example, we see that 2”/2 —1 < cn, <2" —2. 


Question 23. What is the number of distinct cycle spectra of a graph of 
order n? 


6.3. Clique Coverings and Partitions 


A clique covering of a graph G is a set of cliques that together contain all of 
the edges of G; if each edge is contained in precisely one of these cliques, we 
have a clique partition. The clique covering number cc(G) of G is the smallest 
cardinality of any clique covering, and the clique partition number cp(G) is 
the smallest cardinality of any clique partition. The relationship between 
cc(G) and cp(G) was investigated in [30], where the following example was 
described. 


Example 11. For any integer n divisible by 8, let Gn = Knjg+4Kn/g. Then, 
clearly cc(Gn) = 4, and it was shown in [30] that cp(Gn) = n?/16 + 3n/4. 
Thus, 
cp(Gn) 
cc(Gn) 
Question 24. What is the largest C' for which there is a sequence of graphs 
(Gy) such that |V(G,)| =n and 


cp(Gn) 
cc(Gn) 


In [19] the authors exhibit a sequence of graphs (G,) such that 
|V(Gn)| =n 


> n?/64. 


> COn?? 


n? n3/2 n 


Paul Erdés has asked whether the n°/? term is really necessary. 


cp(G,) — cce(Gn) = 


Question 25. Is there a sequence of graphs (G,,) such that |V(G,)| = n and 
cp(Gn) — cce(Gn) = n?/4+ O(n), (n + 00)? 
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1. Introduction and Main Results 


All (undirected) graphs and digraphs considered are assumed to be finite (if 
not otherwise stated) and loopless. Multiple edges (arcs) are permitted. For 
a graph G, let V(G), E(G), and y(G) denote the vertex set, the edge set, 
and the chromatic number of G, respectively. If X C V(G) and F C E(G), 
then G— X — F denotes the subgraph H of G satisfying V(H) = V(G) — X 
and E(H) = {zy | cy € E(G) -—F and z,y ¢ X}. 

A system of non-empty subgraphs {Gj,...,Gm}, m > 1, in a graph 
G is called a decomposition of G if and only if E(G) = Uz2, E(G;) and 
E(G;)N E(G;) =0,1<i<j<m. For an integer k > 2, let G;, denote the 
family of all graphs G which have a decomposition into a Hamiltonian cycle 
and m > 0 pairwise vertex disjoint complete subgraphs each on k vertices. 
Note that every vertex of a graph G € G,, k > 2, has degree either k+1 or 2. 
In particular, G, contains every cycle. 

The authors proved the following result providing an affirmative solution 
to a colouring problem of P. Erdés, which became known as the “cycle-plus- 
triangles problem” (see e.g. [5] or [7]). 


Theorem 1 ([9]). If G € Gs then x(G) <3. 


Actually, Theorem 1 was proved only for 4-regular graphs. But every non- 
regular graph G’ € G3 can be obtained from a 4-regular graph G € G3 by 
subdividing some edges of the Hamiltonian cycle of G, and hence every legal 
3-colouring of G can be extended to a legal 3-colouring of G’. Recently, H. 
Sachs [12] found a purely elementary proof of Theorem 1. 


Theorem 2. Let k > 4 be an integer and let G be a graph. Assume that G 
has a decomposition D into a Hamiltonian cycle and m > 0 pairwise vertex 
disjoint complete subgraphs each on at most k vertices. Then x(G) < k. 
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Proof (by induction on k). Let X = \J)V(K) where the union is taken over 
all complete subgraphs K € D of size < k — 2. Then every vertex of X has 
degree < k — 1 in G implying that every legal k-colouring of G — X can be 
extended to a legal k-colouring of G. Therefore, we need only to show that 
y(G — X) < k. Clearly, G— X is a subgraph of some graph G’ which has 
a decomposition D’ into a Hamiltonian cycle and a (possibly empty) set of 
pairwise vertex disjoint complete subgraphs on k — 1 or k vertices each. We 
claim that .(G’) < k. Let D, be the set of all complete subgraphs from D’ 
with exactly k vertices. If Dj; = @ then from the induction hypothesis or, 
in case k = 4, from Theorem 1 we conclude that x(G’) < k-—1 < k. If 
D, #0 then we argue as follows. First, choose from every complete subgraph 
kK € Dy, an arbitrary vertex y = y(K) and let Y be the set of all these 
vertices. Then G’ — Y is a subgraph of some graph in G,_1 and therefore, 
the induction hypothesis or, in case k = 4, Theorem 1 implies that G’ — Y 
has a legal colouring with k — 1 colours, say 1,2,...,& — 1. Obviously, each 
complete subgraph K —y(K) of G’—Y, K € Dj, contains precisely one vertex 
coloured 1 (note that kK — y(i<) has k — 1 vertices). Now, let Z be the set 
of all such vertices. Then Z is an independent set in G’ — Y, and hence also 
in G’. By the same argument as before we conclude that G’ — Z has a legal 
(k — 1)-colouring implying that G’ has a legal k-colouring, i.e. x(G’) < k. 
This proves our claim, and hence the desired result. 


Theorem 2 is not true for k = 3. A counterexample is the complete graph 
on 4 vertices, a further one due to H. Sachs is shown in Fig. 1. 


Fig. 1 


As a consequence of Theorems 1 and 2 we obtain the following proposition. 


Corollary 1. If GEG, for some k < 3, then x(G) < k. 
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The proof of Theorem 1 given in [9] uses a recent result of N. Alon and 
M. Tarsi [2] which provides a colouring criterion for a graph G based on 
orientations of G. On the one hand, this yields a somewhat stronger result 
than Theorem 1, namely that every graph G € G3 has list chromatic number 
at most 3. On the other hand, this implies that the proof of Theorem 1 is not 
constructive in the sense that it does not yield any algorithm which, given G € 
G3, allows a legal 3-colouring of G to be constructed. For a graph G € Gx, k > 
4, a legal k-colouring of G can be easily obtained using the induction step of 
Theorem 2 and the following colouring procedure for regular graphs from G4. 

Let G € G4 be any 5-regular graph, and let D be a decomposition of 
G into a Hamiltonian cycle C and m > 1 pairwise vertex disjoint complete 
subgraphs K',...,K’ each on 4 vertices. For E’ C E(G), let G(E’) denote 
the spanning subgraph of G with vertex set V(G) and edge set E’. By a linear 
factor of G we mean a set L C E(G) such that G(L) is 1-regular. 

To obtain a legal 4-colouring of G, we first choose an arbitrary linear 
factor L of C (note that the cycle C has length 0 mod 4) and, for each 
i € {1,2,...,m}, an arbitrary linear factor Li of K*. Obviously, the subgraph 
G(F), where F = LUL}U...UL", is the disjoint union of even cycles, and 
hence we easily find a legal 2-colouring c; of G(F’). Now it is easy to check 
that we can choose a linear factor L5 of K'— Li (i =1,...,m) such that cy 
remains a legal 2-colouring of G(F,), where F, = FU L}U...U L3". Then 
F, = E(G) — F, consists of the linear factor L’ = E(C) — L and the linear 
factors Lt = E(K*)— Li — Li (i =1,...,m). Therefore, G(F) is the disjoint 
union of even cycles and has a legal 2-colouring cg. Eventually, because of 
G = G(F, U Fy), the mapping c defined by c(x) = (ci(2), ca(x)), cx € V(G), 
is a legal 4-colouring of G. 

Noga Alon [1] proposed a more general situation. Let H be a graph on 
n vertices. If k divides n, then H is said to be strongly k-colourable if, for 
any partition of V(#) into pairwise disjoint sets V; each having cardinality 
k, there is a legal k-colouring of H in which each colour class intersects each 
V, by exactly one vertex. Notice that H is strongly k-colourable if and only 
if the chromatic number of any graph G' obtained from H by adding to it a 
union of vertex disjoint complete subgraphs (on the set V(H)) each having 
k vertices is k. If k does not divide n, then H is said to be strongly k- 
colourable if the graph obtained from H by adding to it k[n/k] —n isolated 
vertices is strongly k-colourable. Of course, the strong chromatic number of 
a graph H, denoted by sy(H), is the minimum k such that H is strongly 
k-colourable. In this terminology, Theorem 1 says that, for every cycle C3, 
on 3m vertices, sx(C3m) < 3. Moreover, Theorem 2 implies that, for every 
cycle C, sx(C) < 4. For cycles on 4m vertices this statement was established 
by several researchers including F. de la Vega, M. Fellows and N. Alon (see 
[7]). The 4-chromatic graph depicted in Fig. 1 shows that sy(C7) = 4. 

Noga Alon [1] investigated the function sx(d) = max(sy(H)), where H 
ranges over all graphs with maximum degree at most d. On the one hand, 
using similar arguments as above he gives a short proof that sy(H) < 2x'(#), 
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where y’(H) denotes the chromatic index of H. Because of Vizing’s Theorem, 
this yields sy(d) < 2¢+!. On the other hand, using probabilistic arguments, 
he proved that there is a (very large) constant c such that sy(d) < cd for 
every d. This leaves the following problem. What is the minimum value k such 
that every graph with maximum degree d is strongly k-colourable? Even for 
d = 2, the answer is not known. Clearly, for d = 1 we have sy(1) = 2. To 
obtain a lower bound for sy(d), we construct a graph Hg for every d > 1 
in the following way. Let X,, X2, X3,X4 be disjoint sets on r vertices each, 
r>1. Join X; to Xi41 (¢ = 1,2,3) and X4 to X, by all possible edges. The 
resulting graph is H2,. The graph H2,_, is obtained from H2, by removing 
two adjacent vertices. Then the graph Hg, d > 1, is regular of degree d and 
it is easy to check that Hg is not strongly k-colourable for k < 2d — 1, ice. 
sx(H4) > 2d. This yields sy(d) > 2d for every d > 1. 

In [5] P. Erdés asked whether the statement of Theorem 1 remains true if 
the Hamiltonian cycle of the decomposition is replaced by a 2-regular graph 
not containing a cycle of length 4. The graph G depicted in Fig. 2 provides 
a negative answer. This graph belongs to an infinite family of 4-regular 
4-chromatic (in fact, 4-critical) graphs due to Gallai [6] all of which (except 
one) are counterexamples with respect to Erdés’s question. 


Fig. 2 


We define a (not necessarily connected) graph to be Eulerian if and only 
if each vertex has even degree. Analogously, a digraph is said to be Eulerian 
if and only if each vertex has equal out-degree and in-degree. Let G be an 
Eulerian graph. Clearly, G has an orientation D (this means, that D is a 
digraph whose underlying graph is G) which is Eulerian. Such an orientation 
is briefly called an Eulerian orientation of G. We denote the number of all 
Eulerian orientations of G by e(G). If D is an arbitrary Eulerian orientation 
of the Eulerian graph G, then (see [8]) e(G) is equal to the number of all 
spanning Eulerian subdigraphs of D. Therefore, Theorem 2.1 in [9] implies 
the following. 


Theorem 3 ([9]). If G € G3 then e(G) =2 mod 4. 
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Theorem 3 may be considered as the main result of [9]. To deduce 
Theorem 1 from Theorem 3, the colouring criterion of Alon and Tarsi (see 
[2] or [9]) is needed. As an immediate consequence of Theorem 3 we obtain 
the following result. This was first noted by Alon, Gutner and Tarsi. 


Corollary 2. Let G be a graph, which has a decomposition into a Hamil- 
tonian cycle C and m > 1 pairwise vertex disjoint triangles T,,...,Tm, 
i.e. G € G3. Then G has an Eulerian orientation D such that no triangle 
T;(i =1,...,m) is cyclically oriented in D. Moreover, if V;(V2) is the set of 
all those vertices from G which are sinks (sources) of some triangle T; with 
respect to D, then both V; and V2 are independent sets in G each of size m. 


Proof. Let O denote the set of all Eulerian orientations of G, and let O; 
(i = 1,...,m) denote the set of all Eulerian orientations of G in which the 
triangle T; is cyclically oriented. To prove that a desired orientation D of G 
exists, suppose the contrary. Then O = Ji", Oi, and hence 


GB) =1OL= Dyers Deol. 


where o(1) = |()je7 Oi. It is easy to check that, for I 4 0, we have o(I) = 
2!le(G’), where G’ = G—U,_; E(T;). Since G’ € G3, from Theorem 3 it then 
follows that o(1) = 0 mod 4, and hence e(G) = 0 mod 4, a contradiction. 
Therefore, the desired orientation D of G exists. Obviously, the sets Vj and 
V, satisfying the hypothesis of Corollary 2 are both independent sets of size 
m in G. This proves Corollary 2. 


In particular, from Theorem 3 we obtain immediately that every 4-regular 
graph G © Gs with 3m vertices has independence number m, as conjectured 
by D. Z. Du and D. F. Hsu in 1986 (see [8, 7]). 

In connection with the Cycle Double Cover Conjecture R. Goddyn 
(private communication) suspects that there is no snark H with a chordless 
dominating cycle C (this means, that V(H) — V(C) is an independent set in 
Hf). The first author and M. Tarsi observed that the following result can be 
deduced from Corollary 2. 


Corollary 3. Assume that H is a 3-regular graph which has a dominating 
cycle C. Then there is a matching M of H — E(C) such that H — M is 
2-regular or isomorphic to a subdivision of a 3-regular bipartite graph. 


Proof. Put X = V(H)—V/(C), and let M; be the set of all chords of C' in H. 
From the cycle C’ we construct a new graph in the following way. For every 
vertex x € X, join the three neighbours of x on C' by three new edges forming 
a triangle T;,. Clearly, the resulting graph G belongs to G3. From Corollary 2 
it then follows that there is an Eulerian orientation D of G such that no 
triangle T,, is cyclically oriented. Now, let V, (V2) be the set of all those 
vertices from G which are sinks (sources) of some triangle T, with respect to 
D. Then, for every vertex x € X, there is exactly one edge in H joining x to 
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some vertex of V(C) — Vi — V2. Denote the set of all such edges by Mz, and 
put M = M,U Mp. Clearly, M is a matching of H — E(C). Since both Vy 
and V2 are independent sets in G (see Corollary 2), it is easy to check that 
H — M is 2-regular (in case of X = @) or isomorphic to a subdivision of a 
3-regular bipartite graph (in case of X 4 @). This proves Corollary 3. 


One can easily show that if a 3-regular graph H with a dominating cycle 
contains two disjoint matchings as described in Corollary 3, then H has a 
nowhere-zero 5-flow. 


Remark (added in 2013). Moreover, if H as above has two such disjoint 
matchings, then H also has a 5-cycle double cover containing the dominating 
cycle, [10]. It was therefore conjectured by the first author of this article that 
if H is cyclically 4-edge-connected and has a dominating cycle C, then it 
has two such disjoint matchings with respect to C. This was disproved by 
his PhD student Arthur Hoffmann-Ostenhof, [11]. However, together with the 
Dominating Cycle Conjecture (DCC) it would suffice to prove the existence of 
two such disjoint matchings with respect to some dominating cycle, to ensure 
the validity of the cycle double cover conjecture (CDCC) and the nowhere- 
zero 5-flow conjecture (NZ5FC). In fact, to prove the CDCC it would suffice 
to prove the DCC and the existence of two such disjoint matchings with 
respect to a stable dominating cycle; however, this would not be enough to 
prove the NZ5FC. 

Finally, let us mention the following infinite version of Corollary 1. A set 
of cardinality k is briefly called a k-set. 


Theorem 4. Let k > 3 be an integer. For any partition D of the integers 
Z into k-sets, there is another partition {X,,...,Xx} of Z such that X; 
(i = 1,...,k) contains a member of each k-set from D but no consecutive 
pair of integers. 


Proof. Define G to be the infinite graph with vertex set Z, where ry € E(G) if 
and only if |a—y| = 1 or z,y € K € D. Consider an arbitrary finite subgraph 
HT of G. Then it is easy to check that H is a subgraph of some graph from 
G;, and hence, using Corollary 1, we conclude that H is k-colourable. From 
a well-known result of Erdés and de Bruijn [4] we then obtain that G is 
itself k-colourable. This immediately implies that y(G) = k. Obviously, any 
legal k-colouring of G provides a partition {X1, X2,...,X,} of Z having the 
desired properties. 
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Summary. A p-representation of the complete graph K,,,, is a collection of sets 
{51,S2,...,S2n} such that |S; S;| > p if and only ifi <n < j. Let Vp(Knn) 
be the smallest cardinality of US;. Using the Frankl-Rédl theorem about almost 
perfect matchings in uncrowded hypergraphs we prove the following conjecture of 
Chung and West. For fixed p while n — 00 we have Jp(Kn,n) = (1 + 0(1))n?/p. 
Several problems remain open. 


1. The p-Intersection Number of K(n, n) 


One of the important topics of graph theory is to represent graphs, or an 
interesting class of graphs, using other simple structures. One approach is to 
represent the vertices by sets so that vertices are adjacent if and only if the 
corresponding sets intersect (line graphs). More generally, the p-intersection 
number of a graph is the minimum ¢ such that each vertex can be assigned a 
subset of {1,...,¢} in such a way that vertices are adjacent if and only if the 
corresponding sets have at least p common elements. Such a system is called 
a p-representation (or p-intersection representation) of the graph G, and the 
minimum ¢ is denoted by ¥,,(G). 


Problem 1. Given n and p determine max¥,(G) among the n-verter 
graphs. 


For any graph of v vertices Erdés, Goodman, and Pésa [10] showed that 
01 < |v?/4|, and here equality holds for K\/2],[v/2]- This settles the case 
p =1in Problem 1. Myung S. Chung and D. B. West [5] conjectured that the 
complete bipartite graph also maximizes J,. Their lower bound for p > 1 is 


Vy(Knn) = (n? —n)/p+2n. (1) 


In this note we determine J, for these and a few more graphs. Especially, 
we show that in (1) equality holds asymptotically (p is fixed n + oo) with 
a smaller order error term. One of our main construction (Theorem 1) along 
with the lower bound (7) was independently discovered by Eaton, Gould and 
Rédl [6, 9]. They also considered 2-representations of bounded degree trees. 
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More interestingly, considering the the unbalanced complete bipartite graph 
Eaton [7] disproved the Chung-West conjecture by showing that 


L6p? — 12p +1 n? 
9,(Kap) > (1 +o(1 
tee o ») a ( = a )) 16p? a 16p +4 P 


(2) 


Her method was further developed in [8] when J,(K,,,) was exactly deter- 
mined by infinitely many values using certain designs and Steiner systems. 
The complete k-partite graph, Ki ) n, has kn vertices, k disjoint indepen- 
dent sets of sizes n and all the (5) n? edges between different classes. K(n x k) 
denotes a graph with vertex set V!U...UV*, V6 = {uf,...,v&} and uf is 
joined to v7” if and only if i 4 j and £4 m. So K(n x 2) is obtained from 
Kn by deleting a one factor. 


Theorem 1. For fixed p and k, the p-intersection number of the complete 
Ym is (1+ 0(1))n?/p. 


k-partite graph as 

Note that the asymptotic is independent from the fixed value of k. In 
Sect. 4 we will give a partial proof for Theorem 1 using classical design 
theory and obtain a better error term. A Hadamard matrix of order n is a 
square matrix M with +1 entries such that MM = nI,. It is conjectured 
that it exists for all n = 0 (mod 4). The smallest undecided case is larger 
than 184. An S)(v,1,t) block design is a l-uniform (multi)hypergraph on v 
vertices such that each t-subset is contained in exactly \ hyperedges (blocks). 
Block designs with A = 1 are called Steiner systems. All notions we use about 
designs can be found, e.g., in Hall’s book [15]. Wilson [22] proved that for 
any ! there exists a bound vo(l) such that for all v > vp there exists a Steiner 
system $(v, 1,2) if (%)/(9) and (uv — 1)/(J— 1) are integers. 


ies 


Theorem 2. (a) If there exists a Hadamard matrix of size 4p, and a 
Steiner system S(n, 2p, 2), then Jp(K(n x 2)) = (n? — n)/p. 

(b) If p = q* where q is a prime power, d a positive integer, k < q, 
and there exists a Steiner system S(n,q¢*1,2), then B,(K(n x k)) = 
(n? —n)/p. 

Corollary 1. For all p in Theorem 2, and n > no(p) 

n? [p< Yp(Knn) S Sp(K(n x 2) + pn < (n2/p) + 4pm 


This covers all cases p < 46. A construction from the finite projective 
space is given in Sect. 3. In Sect. 5 we list a few open problems. 


2. A Random Construction 


A hypergraph H with edge set €(H) and vertex set V(H) is called r-uniform 
(or an r-graph) if |E| = r holds for every edge E € E(H). The degree, deg,, (x), 
of the vertex x € V is the number of edges containing it. The degree of a pair, 
degy,(x, y), is the number of edges containing both vertices 7 and y. The dual, 
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H*, of H is the hypergraph obtained by reversing the roles of vertices and 
edges keeping the incidences, i.e., V(H*) = E(H). A matching M C E(H) 
is a set of mutually disjoint edges, v(H) denotes the largest cardinality of a 
matching in H. 

We are going to use a theorem of Frankl and Rédl [11]. The following 
slightly stronger form is due to Pippenger and Spencer [20]: For all integers 
r > 2 and real < > 0 there exists a 6 > 0 so that: If the r-uniform hypergraph 
H. on z vertices has the following two properties (i) (1 — 4)d < degy(x) < 
(1+ 4)d holds for all vertices, (ii) deg,,(z,y) < 6d for all distinct x and y, 
then there is a matching in H almost as large as possible, more precisely 


V(H) 2 (1 —€)(z/r). (3) 


Far reaching generalizations of (3) have been proved by Kahn [18]. 

Suppose G is a graph and F = {F\,...,F;} is a family of subsets of 
the vertex set V(G), repetition allowed. Such a system F is called a p-edge 
clique cover if every edge of G is contained in at least p members of F and 
the non-edge pairs are covered by at most p— 1 of the F;’s. A p-edge clique 
cover is the dual of a p-representation (and vice versa), so the smallest t for 
which there is a p-edge clique cover is ¥,(G). This was the way Kim, McKee, 
McMorris and Roberts [19] first defined and investigated J,,(G). 


Proof of Theorem 1. To construct a p-edge clique cover of the complete 
k-partite graph with n-element classes V',...,V* consider the following 
multigraph M. Every edge contained in a class V* has multiplicity p — 1, 
and all edges joining distinct classes (crossing edges) have multiplicity p. The 
total number of edges is 


Jem] = 98(3) +0(S)n = 0+ o(1)) («(2)+()") @ 


vertex set E(M) as follows. The hyperedges of H are those r-subsets of €(M) 
which form a complete k-partite subgraph with p vertices in each V*. The 
number of such subgraphs is 


|E(H)| = (") @- o(S)e* 


i.e., the number of hyperedges of H containing e is exactly 


k-1 
n—2\ (n P k 
deg,,(e) = ( )( ) p-1l k(B)—1n(2) 2, 5 
wle)=("3\(") @-0 (5) 
For any crossing edge f € €(M) connecting two distinct classes we have 


degy(f) = (3) ; (") Ep Oe, (6) 


p-l 


Perret 
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The ratio of the right hand sides of (5) and (6) is n/(n—1), so the hypergraph 
H is nearly regular, it satisfies the first condition in the Frankl-Rodl theorem 
for any 6 > 0 if n is sufficiently large. For two distinct edges, e1,e2 € E(M), 
obviously degy,(e1, e2) = O(n*?~3), so condition (ii) is fulfilled, too. Apply 
(3) to H. We get a system F = {Fj,..., FL} of kp-element subsets of UV? 
such that every pair e contained in a class V* is covered at most p— 1 times, 
every pair f joining two distinct classes is covered at most p times. Moreover, 
v = (1 — 0(1))n?/p, by (4). It follows that almost all edges of iG? are 
covered exactly p times, so the system F can be extended to a p-edge clique 
cover by adding sufficiently many (but only o(n”)) edges. 


3. Exact Results from Finite Projective Spaces 


Proposition 1. For all p> 1, 0)(K(n x 2)) > (n? —n)/p. 


Proof. Let V+ and V? be the two parts of the vertex set of the graph, |V!| = 
|V2| =n, let {A; UB; : 1 <i <t} be a p-edge clique cover, their average size 
on one side is ¢ := )7,(|A;| +|Bi|)/(2t). Using the inequalities 


(1) 04 (9") $ @—D(), and 
(2) X, ('2'!) < (p—1)(8), and the fact that 
(3) all the n? — n crossing edges are covered at least p times we have 


p(n? —n) < > |Ail| Bil < = (ee n @) 


+£t < (p—1)(n? —n) 4+ &. 


This gives (n? — n) < ét. On the other hand, (1) and (2) give 2¢(5) < 
2(p — 1)(5). Hence ¢ < p and t > (n? — n)/p follows. 


Replacing (3) by pn? < >, |A;||B;| the above proof gives 
DalKona) 2 (n+p— 1)?/p, (7) 


which is better than (1) for n < (p—1)?. 

Consider a K(n x k) with classes V1,...,V*,V* = {vf,...,u£}. We call 
the p-edge clique cover F = {F\,..., F;} perfect if the sets {F; NV’: 1 < 
i < n} form an S,_1(n,p,2) design for all @. It follows, that |F;| = kp for 
all F;, every edge of K(n x k) is contained in exactly p sets, every pair from 
V° is covered p — 1 times, every pair of the form {v£, v™} is uncovered, and 
t=(n?—n)/p. 

Proposition 2. If p = q¢, where q is a prime power, d a positive integer 


and k < q, then there exists a perfect p-edge clique cover of K(q?*! x k). 
Hence, in this case, Vy (K(qt*1 x k)) = qt? — q. 
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Proposition 3. If p= q?+q¢!+---+1, where q is a prime power, d a 
positive integer and k < q+1, then OAC A. ogbis) = g?(q¢+q?-14+---+1). 
Proof. The lower bounds for J, are implied by Proposition 1 and (2), 
respectively. The upper bounds are given by the following construction. Let 
X be the point set of a (d + 2)-dimensional projective space of order q, 
PG(d+ 2,q), let Z C X be a subspace of dimension d, and let Y!,..., Y¢+! 
be the hyperplanes containing 7,V° = Y° \ Z. The sets V“ partition X \ Z 
into g/+1-element classes. Choose a point c € V%+! and label the vertices of 
Vé = {uf :1 <i < q@"} in such a way that {vf : 1 < @< q}U{c} forma 
line for all 7. 


The hyperplanes not containing Z and avoiding c induce a perfect p-edge 
clique cover of K(p’+! x k) with classes V',...,V*(k < q). Indeed, PG(d + 
2,q) contains g¢+? 4+ q4+14...+q+1 hyperplanes and they cover each pair of 
points exactly p times. The point ¢ is contained by exactly g¢+!+---+q+1 of 
the hyperplanes, Z is contained in g+1 of them, and ZU {c} is contained in 
a unique one. So the above defined cover consists of q?+? — q sets. These sets 


still cover each pair of the form {vf, ui" },i# Jj exactly p times. However, the 


pairs of the form {v%, uv 


“, Uv} are uncovered, because any subspace containing 
these tow points must contain the line through them, so it must contain the 
element c. 

Similarly, considering all the hyperplanes not containing Z, we get a 


d+1 and 


p-edge clique cover of eee with classes V!,...,V*" where n = q 
k<qtl. 

The dual (perfect) p-representation of K(g?+! x k) can be obtained by 
considering a line, L, in the affine space of dimension d+ 2, and assigning all 
sets of the form Y \ {v‘} to the vertices of the ¢th color class, where v‘ € L, 
and Y is a hyperplane with YNL = {v‘}. Similarly, the dual p-representation 
of Rg on the underlying set X \ L can be obtained by assigning the sets 
Y \ {v‘} to the @th color class. There might be more optimal constructions 
using higher dimensional spaces. 


4. Constructions from Steiner Systems 


Proposition 4. Jf there exists a Hadamard matrix of size 4p, then there 
exists a perfect p-edge clique cover of K(2p x 2), so its 0, = 4p — 2. 


Proof. We are going to give a perfect p-intersection representation with 
underlying set {1,...,4p — 2}. Its dual is a perfect p-edge clique cover. Let 
M be a Hadamard matrix of order 4p. We may suppose that the last row 
contains only +1’s. The +1’s in any other row define a partition of {1,...,4p} 
into two 2p-element sets Pt U P~. We may also suppose that the last two 
entries are Mj 4p-1 = 1, Mi ap = —1 for 1 <7 < 2p. Finally, assign the set 
P+ \ {4p — 1} to the vertex v}, and P> \ {4p} to v?. 
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Note that both in Proposition 2 and here we have got the perfect p-edge 
clique cover from a resolvable S'\(sp,p,2) design, where s is an integer \ = 


(p — 1)/(s — 1). 


Proof of Theorem 2(b). First, we consider a perfect p-edge clique cover, F, 
in the case n = g?+! given by Proposition 2. Consider k identical copies of a 
Steiner system S(n,q?+!,2) over the n-element sets V’. Replace each block 
and its corresponding pairs by a copy of £. Then we obtain a system is a 
perfect p-edge clique cover. 

The proof of case (a) is similar, we put together a perfect p-edge clique 
cover using a building block of size 2p supplied Proposition 4 and a Steiner 
system S(n,2p,2). Taking the sets {vj,v?,...,v¥} p times, we get that 


8(Ke 2) < U,(K(n x k)) + pn. As 8,(K n) is a monotone function 


jacks 


of n we got Corollary 1 . 


Conjecture 1. If (n?—n)/p is an integer and n > no(p,k), then there exists 
a perfect p-edge clique cover of K(n x k), hence its p-intersection number 
Op = (n? —n)/p. 


The case p = 1 corresponds to the fact that there are transversal designs 
T(n,k) (i.e., mutually orthogonal Latin squares of sizes n) for n > no(k) 
(Chowla, Erdés, Straus [4], also see Wilson [23]). 

Chung and West [5] proved the case k = p = 2. They showed 2(K(n x 
2)) = (n? —n)/2 by constructing a perfect 2-edge cover (they call it a perfect 
2-generator) for the cases n = 1,2,5,7,10, or 11 (mod 12). This and (1) 
imply that 


0o(Knn) = (n? +3n)/2 (8) 


holds for these cases. Their conjecture about the so-called orthogonal double 
covers (a conjecture equivalent to the existence of a perfect 2-edge cover of 
K(n x 2)) which conjecture had appeared in [6], too, is true for all n > 8. 
This was proved by Ganter and Gronau [14] and independently by Bennett 
and Wu [1]. So no(2, 2) = 8 and (8) holds for all n > 8. 

There are two more values proved in [5], namely the special cases q = 2 
and q = 3 of the following conjecture. K(q? +q+1 x 2) has a perfect q-edge 
cover whenever a projective plane of order qg exists. This would imply that 
equality holds in (1) for (p,n) =(q+1,¢+q+1). 


5. Further Problems, Conjectures 


The first nontrivial lower bound for J,(K,,,,) was proved by Jacobson [16]. He 
and Kézdy and West [17] also investigated ¥2(G) for other classes of graphs, 
like paths and trees. 


jay 
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Estimate V, for complete bipartite graph with parts of sizes a and b when 
a —> oo,p is fixed and a/b goes to a finite limit. 

Another interesting graph where one can expect exact results is a 
cartesian product, its vertex set is I; x --- x Ip, and (i1,...,i¢) is joined 
to (t1,...,7,) if and only if i, #7, for alll <a<é. 

For a matching, M, of size n it easily follows that J,(M) = min{t : 
()) >n}. 

One can ask the typical value of J,(Q), i.e., the expected value of J, for 
the random graph of n vertices. The case of p = 1 was proposed in [13], and 
the best bounds are due to Frieze and Reed [12] (for an intermediate result 
see Bollobaés, Erdés, Spencer, and West [3]): For almost all graphs its edge 
set can be covered by O(n?/(logn)*) cliques and this magnitude is the best 
possible. Obviously, 


Dp(G) < WiG)+1<0(G)+p-1, 


so the order of magnitude of E(v,) is at most that of E(W1). Can we extend 
the method of [12] to estimate for E(V,)? 

The notion of J, was generalized from the study of the p-competition 
graphs. Another generalization, also having several unsolved questions, is the 
clique coverings by p rounds. Let G be a simple graph and let y,(G) be the 
minimum of }7,<;<, i Such that there are families Ai,...,A, with |Ai| = ni, 
such that each edge e € E(G) is covered by each family (i.e., there exists an 
A € A; with e c A), but this does not hold for the non-edges. It is known 
[13], that for all graphs on n vertices yg < 3n>/3 and for almost all graphs 
y2 > 0.1n4/3/(logn)*/3. For further problems and questions, see [13]. 

Bollobas [2] generalized the Erdés-Goodman-Posa result as follows. The 
edge set of every graph on n vertices can be decomposed into t(k —1,n) parts 
using only K;,’s and edges, where t(k—1, 7) is the maximum number of edges 
in a (k — 1)-colored graph on n vertices, e.g., t(2,n) = |n?/4|. There are 
many beautiful results and problems of this type, the interested reader can 
see the excellent survey by Pyber [21]. Most of the problems can be posed to 
multigraphs, obtaining new, interesting, non-trivial problems. 

Let J5(G) the minimum ¢ such that each vertex can be assigned a subset 
of {1,...,¢} in such a way that the intersection of any two of these sets is 
at most p, and vertices of G are adjacent if and only if the corresponding 
sets have exactly p common elements. Note that in all of the results in this 
paper were proved an upper bound for J}. If G” is a graph on n vertices with 
with 2n — 3 edges such that two vertices are connected to all others, then 
one can show that lim, 05 (G") — J,(G") = oo for any fixed p. What is 
max(J5(G) — Jp)(G)) and max(V5/V,) for different classes of graphs? 
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Summary. We consider some problems suggested by special cases of a conjecture 
of Erdés and Hajnal. 


1. Epsilons 


The problem I am going to comment on reached me in 1987 at Memphis in a 
letter of Uncle Paul. He wrote: ‘We have the following problem with Hajnal. 
If G(n) has n points and does not contain induced C4, is it true that it has 
either a clique or an independent set with n* points? Kind regards to your 
boss + colleagues, kisses to the ¢-s. E.P.’ After noting that ¢ has been used 
in different contexts I realized soon that ; is a good ¢ (in both senses since I 
have three daughters). About a month later Paul arrived and said he meant 
Cs for Cy. And this minor change of subscript gave a problem still unsolved. 
And this is just a special case of the general problem formulated in the next 
paragraph. 


2. The Erdés-Hajnal Problem (from [7]) 


Call a graph H-free if it does not contain induced subgraphs isomorphic 
to H. Complete graphs and their complements are called homogeneous sets. 
As usual, w(G) and a(G) denotes the order of a maximum clique and the 
order of a maximum independent set of G. It will be convenient to define 
hom(G) as the size of the largest homogeneous set of G, i.e. hom(G) = 
max{a(G),w(G)} and 
hom(n, M1, H2,...) = min{hom(G) : |V(G)| = n,G is H;-free} 

A well-known result of Paul Erdés [5] says that there are graphs of n 

vertices with hom(G) < 2logn (log is of base 2 here). The following problem 


of Erdés and Hajnal suggests that in the case of forbidden subgraphs hom(G) 
is much larger: Is it true, that for every graph H there exists a positive « and 
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ng such that every H-free graph on n > no vertices contains a homogeneous 
set of n* vertices? If such € exists for a particular H, one can define the ‘best’ 
exponent, ¢(H) for H as 


e(H) =sup{e > 0: hom(n, H) > n* for n > no}. 


The existence of e(H) is proved in [7] for Py-free graphs (usually called 
cographs but in [7] the term very simple graphs has been used). In fact, a 
stronger statement is proved in [7]: if e¢(H;) exists for 7 = 1,2 and H isa 
graph formed by putting all or no edges between vertex disjoint copies of Hy 
and Hy then ¢(H) also exists. Combining this with the well known fact that 
P,-free graphs are perfect [14], it follows that ¢(H) exists for those graphs 
HT which can be generated from the one-vertex graph and from P4, using the 
above operations. In the spirit of [7], call this class SVS (still very simple). 
In terms of graph replacements (see Sect.4 below), SVS is generated by 
replacements into two-vertex graphs starting from Ky and P,. As far as I 
know, the existence of ¢(H) is not known for any graph outside SVS. 


3. Large Perfect Subgraphs 


A possible approach to finding a large homogeneous set in a graph is to find 
a large perfect subgraph. It was shown in [7] that any H-free graph of n 
vertices has an induced cograph of at least e°(7 )vl08” vertices for sufficiently 
large n. This shows that the size of the largest homogeneous set makes a huge 
jump in the case of any forbidden subgraph. In a certain sense it is not so far 
from n°\#), What happens if cographs are replaced by other perfect graphs? 
A deep result of Prémel and Steger [13] says that almost all C5-free graphs 
are perfect (generalized split graphs) This suggest the possibility to find a 
large (n*) generalized split graph in a Cs-free graph of n vertices and prove 
the existence of ¢(Cs) this way. 


4. Replacements 


A well-known important concept in the theory of perfect graphs is the 
replacement of a vertex by a graph. The replacement of vertex x of a graph 
G by a graph H is the graph obtained from G by replacing x with a copy of 
HT and joining all vertices of this copy to all neighbors of x in G. According 
to a key lemma (Replacement Lemma) of Lovasz (see for example in [12]), 
perfectness is preserved by replacements. The property of being H-free is 
obviously preserved by replacements if (and in some sense only if) H can 
not be obtained from a smaller graph by a nontrivial (at least two-vertex) 
replacement. For such an H, replacements can be applied to get an upper 
bound on e(#). Analogues of the Replacement Lemma can be also useful to 
find large homogeneous sets (an example is Lemma 1 below) 
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5. Partitions into Homogeneous Sets 


For certain graphs H, the existence of e(H) follows from stronger properties. 
An H-free graph G may satisfy y(G) < p(w(G)) or 0(G) < p(a(G)) or 
more generally cc(G) < p(a(G),w(G)) where p is a polynomial of constant 
degree and x,0,a,w,cc denote the chromatic number, clique cover number, 
independence number, clique number and cochromatic number of graphs. 
Using terminology from [10], p is a polynomial binding function (for x, 9, cc, 
respectively). It is clear that if H-free graphs have a polynomial binding 
function of degree k then e(H) > ee Binding functions for y (for @) in 
H-free graphs may exist only if H(H) is acyclic. However, the existence of 
a polynomial binding function for cc in H-free graphs is equivalent with the 
existence of e(H). 


6. Small Forbidden Subgraphs 


The existence of ¢(H) follows if H has at most four vertices since these 
graphs are all in SVS. But, as will be shown below, to find ¢(H) for these 
small graphs is not always that simple... 

Since hom(n, H) = hom(n, H) from the definition, it is enough to consider 
one graph from each complementary pair. For H = K,,, finding hom(n, H) 
is the classical Ramsey problem. In case of m = 2,3,4,¢(K2) = 1 (trivial), 
e(K3) = 5 (from Uncle Paul’s lower bound on R(3,m) in [4]), 4 < e(Ka) < 
0.4 (the upper bound is due to Spencer [15]). If H = P; or H = P, then an 
H-free graph is perfect and thus e(H) = $. 

There are four more graphs with four vertices to look at. Let Hy be K1,3, 
the claw, and let Hz be K3 with a pendant edge. It is not difficult to see 
that ¢(H;) = } in this case. The construction is simple: let G be a graph on 
m vertices with no independent set of three vertices and with no complete 
subgraph of much more than \/m vertices [4]. Take about vm disjoint copies 
of G. This graph is H,-free, has m2 / 2 vertices and has no homogeneous 
subset with much more than m? vertices. The complement of this graph 
is good for Hz. On the the other hand, let G be an H;-free graph with n 
vertices (i is 1 or 2). If the degree of a vertex v is at least n3 then I'(v) 
(the set of vertices adjacent to v) contains a homogeneous set of at least 
n3 vertices (in the case of H; by Ramsey’s theorem, in the case of Hz by 
perfectness). Otherwise G has an independent set of at least n3 vertices. 
This gives 


Proposition 1. If H is the claw or K3 with a pendant edge then e(H) = $. 


The remaining two H-s are the Cy and K4 minus an edge (the diamond). 
The following argument is clearly discovered by many of us, could be heard 
from Uncle Paul too. It was used for example in [9, 16]. Let S = {v1,...,va} 
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be a maximum independent set of a C4-free or diamond-free graph G. Then 
V(G) is covered by the following | sets: A(i), BUi,j), 1<i<j<a 
where 


A(t) = {vE V(G)—S:T(w)nS = {u,}} U {u;} 
and 
Bii,j) = {ve V(G)-—S:TWw)NS 2D {ui, vf}. 


The sets A(z) induce complete subgraphs by the maximality of S and the 
sets B(i, 7) induce homogeneous sets (complete if G is C4-free, independent 
if G is diamond-free). This gives 


Proposition 2. If G is C4-free or diamond-free then cc < ( 


Corollary 1. If H is either Cy or the diamond then hom(n,H) > (2n)3. 
Therefore e(H) > ; 


Vertex disjoint unions of complete graphs shows that e(H) < $ for any 
connected graph H. In the case of H = Cy this upper bound can be improved 
as follows. Let R(C4,m) be the smallest integer k such that any graph on 
k vertices either contains a Cy (not necessarily induced C4!) or contains an 
independent set on m vertices. F. K. Chung gives a graph G,,, in [3] which 
shows that R(C4,m) > m3 for infinitely many m. Replacing each vertex of 

m 


Gm by a clique of size 4 we have a graph with no induced C4 and with no 


homogeneous subset larger than m. This gives 
Proposition 3. <(C4) < 2. 


Notice that if R(C1,m) > m?~* with every ¢ > 0 as asked by Uncle Paul 
then the replacement described above would show that ¢(C4) = $. 

Perhaps the next construction has a chance to improve the upper bound 
on ¢(H) if H is the diamond. The vertices of G, are the points of a linear 
complex [11] of a 3-dimensional projective space of order g. Two points are 
adjacent if and only if they are on a line of the linear complex. The graph 
G, has q? + q? +q+1 vertices, w(G,) = q+ 1 and G, is diamond-free. 
But is it true that a(G,) < qz-e for some positive e« and for infinitely 
many g? Thanks for the conversations to T. Szényi who thinks this is 
not known. 


Problem 1. Improve the exponents in the above estimates of hom(n, H) if 
HA is either C4 or the diamond. 


What happens if G is Cy-free and diamond-free? In these graphs each 
four cycle induces a K4, The sets B(i, 7) collapse implying 


Corollary 2. hom(n, C4, Diamond) > \/ gn —1. 


Problem 2. Is it true that hom(n, C4, Diamond) = \/n + 0(./n) ? 
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During the years between the submission and publication of this paper, 
Problem 2 had been answered affirmatively, in fact hom(n, Cy, Diamond) = 
[Vn] (6) 

There are eight five-vertex graphs outside SVS. Keeping one from each 
complementary pair reduces the eight to five: Ky3 with a subdivided edge, 
the bull (the self-complementary graph different from C5), Cy with a pendant 
edge, Ps, and Cs. The existence of ¢(H) is open for all of them, perhaps 
the list is about in the order of increasing difficulty. The construction in 
Proposition 1 shows that <(H) is at most 4 for all but Cs. In the case of Cs 

log 3 


repeated replacements of C7UK3 into itself shows e(Cs) < lord: (Any C5-free 


graph G with hom(G) = 3 and with at least 11 vertices would improve this.) 


7. Forbidden Complementary Pairs 

Perhaps an interesting subproblem is to find bounds on hom(n, H, H). In the 
case of four-vertex H, the structure of graphs which are both H-free and 
H-free is well understood and values of hom(n, H, H) can be determined as 
follows: n? if H = Py, (from perfectness); n2 — 1 if H = P3+ K, (from 
structure, [10]); "5+ if H = C4 (from structure, [2]); n—4 if H is a diamond 
(from structure, [10]); = if H is a claw (from structure, [10]). 

The rest of this section is devoted to the case H = Ps. The upper bound 
hom(n, Ps, Ps) < nless is shown by replacing repeatedly Cs into itself. The 
lower bound n3 will follow from Corollary 3 which is the consequence of the 
following result. 


Theorem 1. If G is Ps-free and Ps-free then G satisfies the following 
properly SP*: there is an induced perfect subgraph of G whose vertices 
intersect all maximal cliques of G. 


Notice that property SP* is a generalization of strong perfectness 
introduced by Berge and Duchet in [1]. (Maximal clique is a clique which 
is not properly contained in any other clique.) By Theorem 1, if G is both 
P;-free and Ps-free then G can be partitioned into at most w(G) vertex 
disjoint perfect subgraphs. Each of these perfect graphs has clique number 
at most w(G) thus each has chromatic number at most w(G). This gives the 
next corollary. 


Corollary 3. If a graph is both Ps-free and Ps free then y <w”. 


The proof of Theorem 1 is combining a result of Fouquet [8] with the 
following analogue of the Lovasz replacement lemma. 


Lemma 1. Property SP* is preserved by replacements. 


Proof. The proof of the Lemma is along the same line as the replacement 
Lemma of Lovadsz. Assume that G and H have property SP* and R is the 
graph obtained by replacing v € V(G) by H. Let G; and Hy; be perfect 
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subgraphs of G and H such that V(G) intersects all maximal cliques of G 
and V(H;) intersects all maximal cliques of H. 


Case 1. uv €¢V(G1). We claim that V(G1) intersects all maximal cliques of 
R. Let K be a maximal clique of R. If V(kK) N V(#) is empty then the 
claim follows from the definition of G;. Otherwise {v} U(K NV(G)) isa 
clique of G which can be extended in G' to a maximal clique K’ intersecting 
V(G}). Since K is obtained by replacing v € K’ by KNV (HA), K intersects 
V(G4). 

Case 2. v€V(G1). By the Lovadsz replacement lemma, the subgraph Z of 
R induced by (V(G1) U V(Ai)) — {v} is perfect. If a maximal clique K 
of R intersects V(H), it intersects it in a maximal clique of H which (by 
the definition of H,) intersects V(Hj). If kK does not intersect H then it 
does not contain v so it intersects V(Gi) — {v} by the definition of G1. 
Therefore K intersects Z. 


Theorem 2 (Fouquet [8]). Each graph from the family of Ps-free and Ps- 
free graphs is either perfect or isomorphic to Cs or can be obtained by a 
nontrivial replacement from the family. 


Now Theorem 1 follows by induction from Lemma 1 and Theorem 2. 


8. Berge Graphs 


These are graphs which do not contain induced subgraphs isomorphic to 
Cor41 or to Cox41 for k > 2. According to the Strong Perfect Graph 
Conjecture (of Berge), Berge graphs are perfect. The following weaker form 
of this conjecture is attributed to Lovasz in [7] (illustrating the difficulty of 
proving the existence of e(Cs)). 


Problem 3. There exists a positive constant e such that Berge graphs with 
n vertices contain homogeneous subsets of n= vertices. 


Similar problems can be asked for subfamilies of Berge graphs for which 
the validity of SPGC is not known. One of them is the following. 


Problem 4. Show that C4-free Berge graphs with n vertices contain homo- 
1 1 : 
geneous subsets of n2 (or at least cn?) vertices. 


9. Notes Added in 2013 


Roughly 15 years went by... Replacements suggested in Sect.4 have been 
developed in [17], implying the existence of <¢(H) for many new graphs H. 
Two of them, together with the bull resolved separately in [19], leave only 
two five-vertex graphs (from the five in Sect.6) for which the Erdés-Hajnal 
conjecture is open: Ps (or its complement) and C;. The theme of Sect. 7 
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is 


recently revitalized, see [20, 21]. Finally, with the Strong Perfect Graph 


Theorem [18], the problems of Sect. 8 are resolved. 
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Summary. A two-packing of a graph G is a bijection 0 : V(G) +» V(G) such 
that for every two adjacent vertices a, b € V(G) the vertices o(a) and o(b) are not 
adjacent. It is known [2, 6] that every forest G which is not a star has a two-packing 
o. If FG is the graph whose vertices are the vertices of G and in which two vertices 
a, b are adjacent if and only if a, b or o~'(a), g~'(b) are adjacent in G then it is 
easy to see that the chromatic number of F% is either 1, 2, 3 or 4. We characterize, 
for each number n between one and four all forests F’ which have a two-packing o 
such that F, has chromatic number n. 


Keywords Packing, Placement, Factorization, Tree, Forest, Chromatic num- 
ber 


AMS Subject Classification: 05C70 


1. Introduction 


We only discuss finite simple graphs and use standard terminology and 
notation from [1] except as indicated. For any graph G, we use V(G) and 
E(G) to denote the set of vertices and the set of edges of G, respectively. 
A forest is a graph without cycles. A tree is a connected forest. The length 
of a path or cycle is the number of its edges. A path of length n is denoted 
by P,, and a cycle of length n is denoted by C;,. The complete graph on n 
vertices is denoted by K,. A vertex of a graph G is an isolated point if its 
degree is zero and it is an endpoint of G if its degree is one. A vertex of a 
forest F' adjacent to an endpoint of F is called a node of F’. The set of vertices 
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of a forest fF which are adjacent to at least one of the endpoints of a set A 
of endpoints is called the set of nodes belonging to the set A of endpoints. 

The distance between two vertices x and y of a graph G is the length 
of the shortest path in G from x to y. The diameter of a graph G is the 
largest distance between any two vertices of G. Observe that if Tis a tree 
and the distance between the vertices x and y of T is the diameter of T, then 
the vertices « and y are endpoints of T. A vertex x of a tree T is in the center 
of T, or is a central point of T, if the maximal distance from x to any other 
vertex of T is minimal for the vertex x. Observe that if the diameter of T' is 
odd then T has exactly two central points and if the diameter of T is even, 
then JT has exactly one central point. 

A tree of order at least two has at least two endpoints and if it has exactly 
two endpoints it is a path. A tree with exactly two nodes is called a dragon. 
Note that every tree of diameter three is a dragon. A tree of diameter two or 
one with n > 1 edges is called a star S,,. Hence an isolated point is not a star. 
If A is a set of vertices of the graph G then G—A is the graph obtained from G 
by removing the vertices in A together with all of the incident edges from G. 
We will write G—a for G— {a}. If F is a forest and A a set of endpoints 
of F' then the forest F' — A is called a derived forest of F’. If A is the set of 
all endpoints of F then F' — A is the completely derived forest of F’. A tree 
T is a crested dragon if T has a path P = aj, a2,a3,a4 of length three as a 
derived tree, the vertices a,, a4 and possibly a3 are nodes of T', every node 
of T is in the set {a1,a3,a4} and the number of endpoints adjacent to a, is 
strictly larger than the number of endpoints adjacent to a3. Observe that the 
crested dragon T is a dragon of diameter five if and only if a3 is not a node 
of T. See Fig. 1. 


dragon of diameter five crested dragon 


Fig. 1 


Let G be a graph and o a bijection from V(G) to V(G). If o has the 
property that for every pair a and 6 of adjacent vertices of G the vertices 
o(a) and o(b) are not adjacent then o is called a two-packing of the graph 
G. If o is a two-packing of G then the graph o(G) has the same vertices as 
the graph G and two vertices a and b of o(G) are adjacent in o(G) if and 
only if the vertices o~'(a) and o~1+(b) are adjacent in the graph G. If E is 
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the set of edges of G then o(F) denotes the set of edges of o(G). Note that 
EQ o(E£) =@. The graph G, has the same vertices as the graph G and the 
edges of G, are given by E(G,) = E(G) U E(a(G)). The graph G, can be 
factorized into the graphs G and o(G). Hence results about two-packings of 
graphs can also be understood as results about factorizations of graphs. 

Burns and Schuster proved that if a graph G of order n does not contain a 
vertex of degree n—1 and contains no cycles of length 3 or 4 and |£(G)| = n—1 
then there is a two-packing of G, [2]. Faudre, Rousseau, Schelp and Schuster 
proved that if a graph G of order n does not contain a vertex of degree n — 1 
and contains no cycles of length 3 or 4 and |E(G)| < 6n/5 — 2, then there is 
a two packing of G, [3]. St. Brandt proved that if G is a non-star graph of 
girth larger than or equal to seven then there is a packing of two copies of G, 
[4]. It follows from [2] that every tree which is not a star has a two-packing 
and we recently [6] characterised those trees which have a “three-packing”. 

The results mentioned above establish the existence of two-packings of 
a graph G but are not concerned with any properties of the graph G,. 
From the point of factorizations of graphs this would be a very interesting 
question. Given the difficulties in establishing the existence of two-packings 
we looked for a simple case in which we might be able to determine the 
possible chromatic numbers of the two-packings of a graph. Note first that 
for any two-packing o of G the chromatic number of G, satisfies the inequality 
x(G) < x(G.) < y(G)?. This means that if F is a forest which has at least 
one edge and @ is a two-packing of F then 2 < y(F,) < 4. If F, has chromatic 
number less than or equal to two we will call the two-packing o a bipartite 
two-packing. 

If G is a connected bipartite graph then the partition of G into the two 
color classes is unique. We will call the color class of a vertex x the parity of 
x and will often call the vertices in one class the vertices of even parity or 
even vertices and the vertices in the other class the vertices of odd parity or 
the odd vertices. We will then take care to label the even vertices with even 
and the odd vertices with odd numbers. A consequence of the uniqueness of 
the color-classes of a connected bipartite graph G is, that if o is a bipartite 
two-packing of G then either for every vertex x of G, o(a) has the same 
parity as x or for every vertex x of G, a(x) has a different parity than x. In 
the first case we will say that the bipartite two-packing o is an equal parity 
packing and in the second case we will say that the bipartite two-packing o 
is an unequal parity packing. If F’ is a forest and o a two-packing of F’ then 
o is an equal parity two-packing of F if for every vertex a of F', whenever 
o(a) and a are in the same connected component of F’ then, in this connected 
component of F’, the parity of o(a) is the same as the parity of a. 

It will be often necessary to exhibit a two-packing a for some small forest 
F, We usually will do this in a figure in which the edges of F are solid lines 
and the edges of o(F) are interrupted lines. In order to check the claim that 
the exhibited map a is indeed a two-packing one has to check that the graph 
with the interrupted edges is indeed isomorphic to the graph having solid 


146 Hong Wang and Norbert Sauer 


edges and that the graph with the solid edges is the intended graph. It is 
then visually clear that the two edge-sets have no edge in common. In the 
case of a bipartite two-packing o, if in the text a vertex is described by a 
labeled letter then in the figure the label alone will be used to name the 
vertex. The labels of the vertices of F' will be in bold times italic typeset and 
the vertices of o(F’) in gray courier oblique. All vertices of one parity will 
have even and all the other vertices will have odd labels. In order to check 
that o is an equal parity two-packing it suffices to check that the parity of 
the two labels of each vertex is the same. 

We will, for each number n between one and four, completely characterize 
all forests F which have a two-packing a such that F, has chromatic 
number n. More exactly we will prove the following theorems: 


Theorem 1. A forest has a bipartite two-packing if and only if it is either 
a singleton verter or disconnected or a tree of diameter at least five which is 
not a crested dragon. Every forest which has a bipartite two-packing has an 
equal parity two-packing. 


Theorem 2. A forest F has a two-packing o such that F has chromatic 
number three if and only if either F is a tree with diameter at least four or 
F is disconnected and contains a path of length two. 


Theorem 3. A forest F has a two-packing a such that F, has chromatic 
number four if and only if F contains a path of length three and F' does not 
consist of exactly two connected components where one of the two components 
is a path of length three and the other is a star. 


2. Preliminary Results 


If F is a forest, B a set of vertices of F and o a two-packing with the property 
that for all b € B, o(b) ¥ b holds, then we say that o splits the vertices of 
B. If B consists of a single vertex b we say that the two-packing o splits the 
vertex b. 

If o is an equal parity two-packing which splits all vertices of V(F’) then 
we say that o is a perfect packing. Let A be a set of endpoints of a forest G 
and B the set of nodes which belong to A. Assume that o is a two-packing 
of G — A which splits the vertices of B and 4 the extension of o from G— A 
to F such that for all a € A, \(a) = a holds. Obviously \ is a two-packing of 
G. We will call \ a trivial extension of o to the forest G. The following First 
Preservation Lemma holds for trivial extensions of two-packings. 


Lemma 1 (First Preservation Lemma). Let o be a two-packing of a 
forest F and X a trivial extension of o to the forest G. If F, has chromatic 
number four then Gy has chromatic number four. If F, has chromatic number 
three then Gy has chromatic number three. If o is an equal parity two-packing 
then is an equal parity two-packing. 
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Proof. Assume that A is the set of endpoints of G such that G— A = F 
and that B is the set of nodes of G which belong to the set A of endpoints. 
Then by the definition of trivial extension, the two-packing o splits the nodes 
in B. The chromatic number of G) is larger than or equal to the chromatic 
number of F,, and because the chromatic number of any two-packing of a 
forest is at most four, the first of the assertions follows. The endpoints in A 
will have degree two in Gy, which implies the second assertion. Let us now 
assume that o is an equal parity two-packing. Then, for every node b € B 
and every two coloring of F’,, the vertices b and o(b) will be colored with the 
same color. This of course implies the third assertion. 


Corollary 1. If some derived forest of a forest F has a perfect packing then 
F itself has an equal parity two-packing. 


Let F' be a forest, a and b two endpoints of F’, a; the node adjacent to a 
and b; the node adjacent to b. If a, and 61, are different vertices then the pair 
of endpoints a and 0 is called a reducing pair of endpoints of the forest F’. If 
the nodes a; and 6; are in the same connected component of F and have the 
same parity then clearly also the endpoints a and b have the same parity and 
then a and 0 is an equal parity reducing pair of endpoints of the forest F’. 
Let a, b be a reducing pair of endpoints of the forest F’, a, and ag the nodes 
adjacent to a and 6 respectively and o a two-packing of the forest F' — {a, b}. 
Let be the extension of o from F — {a,b} to F' such that if o(a,) = a; or 
o(b,) = 6; then A(a) = b and X(b) = a. In all other cases we put A(a) = a 
and A(b) = b. Clearly 4 is a two-packing of the forest F’. We will call the 
extension of o to the reducing pair a,b of endpoints. The following Second 
Preservation Lemma holds for extensions of two-packings to reducing pairs 
of endpoints. 


Lemma 2 (Second Preservation Lemma). Let F' be a forest, a, b a 
reducing pair of endpoints of F', o a two-packing of the forest F — {a,b} 
and \ an extension of o to the reducing pair a, b of endpoints of F. If Fs 
has chromatic number four then Fy) has chromatic number four. If Fz has 
chromatic number three then Fy has chromatic number three. If o 1s an equal 
parity two-packing and a, and b, have the same parity then X is an equal 
parity two-packing. If o splits a set B of vertices then splits the set B of 
vertices. 


Proof. The chromatic number of F) is larger than or equal to the chromatic 
number of F,, and because the chromatic number of any two-packing of a 
forest is at most four, the first of the assertions follows. The endpoints a 
and 6b will have degree two in F\ which implies the second assertion. Let us 
now assume that o is an equal parity two-packing and that the nodes a, 
and b; have the same parity. Then, for every proper two coloring of F,, the 
vertices a1, 61, o(a,) and o(b;) will be colored with the same color. This of 
course implies the third assertion. Because o and X agree on F' — {a,b} the 
two-packing A splits all vertices which are split by a. 
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Observe the following fact: 


Lemma 3. If T is a tree with diameter one or two then T does not have a 
two-packing. 


Lemma 4. For every two-packing o of a tree T with diameter three the graph 
Ts has chromatic number four. 


Proof. Denote the two central points of T by a and b. If o(a) = a then o(b) 
is an endpoint not adjacent to a and hence adjacent to b. We will show that 
o(b) and 6 are also adjacent in o(T), a contradiction. Because ¢ is an onto 
map and o maps all vertices adjacent to a to endpoints adjacent to b there 
is some endpoint x adjacent to 6 such that o(#) = b. But then o(z) = 6b is 
adjacent to o(b) in o(T). If o(a) = 6 then o(b) is an endpoint not adjacent 
to b and hence adjacent to a. We will show that o(b) and a are also adjacent 
in o(T), a contradiction. Because o is an onto map and o maps all vertices 
adjacent to a to endpoints adjacent to a there is some endpoint x adjacent 
to 6 such that o(a) = a. But then a(x) = a is adjacent to o(b) in o(T). We 
conclude that {a(a),o(b)} {a,b} = 0. 

We will now prove that TU o(T) induces the complete graph on the set 
{o(a),a(b), a,b} of four vertices. It follows from the result of the previous 
paragraph that there are two endpoints x and y of T such that o(a) = x 
and o(b) = y. Again because o is onto there are two endpoints u and v 
of T such that o(u) = a and o(v) = b. The graph T U o(T) induces then 
six different edges in the set {o(a),a(b), a,b} of vertices. Those are first the 
two edges from a to 6 and o(a) to o(b). Then each of the four different 
points x, y which are endpoints of T and o(u), o(v) which are endpoints 
of o(T) is in the graph T U o(T) adjacent to at least one of the vertices 


{a(a),o(b), a, bd}. 


Lemma 5. If T is a tree with diameter at least two and at most four then 
T does not have a bipartite two-packing. 


Proof. Observe that every tree T of diameter at most four contains a vertex 
a which is adjacent to every vertex of T’ whose parity is different than the 
parity of the vertex a. If T has a bipartite two-packing o then because @ is 
a bijection, there is some vertex b of T such that o(b) = a. This means that 
the parity of a is different from the parity of b, otherwise b would have to be 
an isolated point (and if the diameter of a tree is larger than one it does not 
contain an isolated point). Because the vertex a is adjacent to every vertex 
whose parity is different from the parity of a, the vertices a and b are adjacent 
in T and therefore o(a) and o(b) are adjacent in o(T). Remember that every 
bipartite two-packing of T either changes the parity of all vertices or maps 
each parity class into itself. Hence o(a) is a vertex c adjacent to a. This now 
leads to a contradiction. The vertices a and c are adjacent in T’, and because 
a =o(b) and c= o(a) they are adjacent in o(T). 
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3. Bipartite Two-Packings 


Lemma 6. Every path of length at least seven has a perfect packing. 


Proof. If the length r of the path is seven, eight or nine Lemma 6 follows from 
Figs. 2, 3, and 4. So assume that P,.1 = ao, @1,...,@,41 is a path with r > 9 
and o is a perfect packing of the path P, = ao, a1,...,a,. Assume that for x 


150 Hong Wang and Norbert Sauer 


in P,, o(a) = a,. There are at least five vertices in the path P, which have 
the same parity as the vertex a,+ . This means that at least one of them, say 
a; with o(y) = a; has the following properties: aj, 4 o(a,) # aj) and y 
is not adjacent to x. The following bijection \ from (P,41) to (P,+1) is then 
a perfect packing of P,+1. First of all A agrees with o on (P,_,) and then 
A(y) = Gr41 and A(a;+41) = a;. It is easy to check that is a perfect packing 
of Prt. 


Lemma 7. If a forest G has an equal parity two-packing which splits the 
vertices of some set B, then the forest F which consists of G together with an 
additional isolated point e has an equal parity two-packing which splits every 
vertex in B and which also splits the isolated point e. 


Proof. Let o be an equal parity two-packing of G which splits every vertex in 
B. Let a be any vertex of G and X a function from V(G) U {e} to V(G) U {e} 
which agrees with o on all vertices of V(G) — a and for which A(e) = o(a) 
and A(a) = e. Clearly 4 is an equal parity two-packing of G which splits the 
vertices of BU {e}. 


Lemma 8. Every forest F' with two components where one of the compo- 
nents is an isolated point e and the other component T is a dragon, a star, 
or an isolated point, has an equal parity two-packing which splits the isolated 
point e. 


Proof. If T is an isolated point then F' clearly has an equal parity two- 
packing which splits e. If TJ is a star then a derived forest of F’ consists 
of two isolated vertices and hence there is a derived forest of F’ which has 
a perfect packing. Lemma 8 follows from the Corollary to Lemma 1. Let 
the path P, = a1,a2,...,@, be the completely derived tree of T and denote 
the nonempty set of endpoints adjacent to a, by A and the nonempty set 
of endpoints adjacent to a; by B. Let a € A and b € B be two endpoints 
of T. The path P, together with a and 6 is then a path P of length r + 1. 
If the forest whose connected components are P, and e has an equal parity 
two-packing in which the vertices a,, a, and e are split, then according to 
the First Preservation Lemma, the forest F' has an equal parity two-packing 
which splits e. It follows from Lemmas 6 and 7 that the forest whose connected 
components are P and e has an equal parity two-packing in which the vertices 
a,,a, and e are split if r > 6. If r = 3 or r= 5 then P, is a star Sp or has 
an equal parity reducing pair of endpoints a and b such that P, — {a,b} isa 
star Sp. Using the Second Preservation Lemma we see that Lemma 8 holds 
in this case. This leaves the cases r = 2 and r = 4. If r = 2 Fig.5 exhibits 
an equal parity two-packing of P together with an isolated point which splits 
the two nodes of P and the isolated point. 

Consider r = 4. Let ap € A, as € B. See Fig.6 in which 1, 4 and e are 
split. Then by the First Preservation Lemma, Lemma 8 holds for this case as 
well. 
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Lemma 9. Every forest F with two components, where one of the compo- 
nents is an isolated vertex e and the other a tree T, has an equal parity 
two-packing which splits e. 


Proof. By the Second Preservation Lemma we may assume that T does not 
contain two different nodes which have the same parity. This means that T 
can not have three different nodes, hence T is a dragon, a star or an isolated 
point. Lemma 9 follows then from Lemma 8. 


Lemma 10. Every forest F with at least two connected components has an 
equal parity two-packing. 


Proof. Observe first that if we obtain the forest G from F' by adding an 
edge which is adjacent to vertices in different connected components of F, 
and if G has an equal parity two-packing, then F’ has an equal parity two- 
packing. We may therefore assume without loss of generality that the forest 
F has exactly two components G and H. Let G, be the forest which consists 
of G together with an additional vertex e; and H, the forest which consists 
of H together with an additional vertex ez. Let y be an equal parity two- 
packing of G; which splits e; and \ be an equal parity two-packing of Hy 
which splits e2. Identify the vertex y(e1) with the vertex e2 and the vertex 
A(e2) with the vertex e,. It is not difficult to check that yUA = a is an equal 
parity two-packing of the forest F’. 


Lemma 11. A crested dragon T does not have a bipartite two-packing. 


Proof. Let P = a1,a2,a3,a4 be the completely derived tree of the crested 
dragon T’. Assume that the nodes of T are a1, a4 and possibly a3. Let A be 
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the set of endpoints adjacent to a,, B the set of endpoints adjacent to a4 
and C the set of endpoints adjacent to a3. Then |A| > |C| and C might be 
empty if T is a dragon of diameter five. Assume for a contradiction that o is 
a bipartite two packing of T’. 


Case 1. The two-packing o is an equal parity two-packing. 


Every even vertex of T is adjacent to at least one of the odd vertices a1 
and a3. If the distance in T between o(a,) and o(a3) is two, then for some 
even vertex x of T, a(x) would be in T adjacent to both vertices o(a,) and 
a(a3) and in o(T) to at least one of the vertices o(a,) and o(a3). This is not 
possible because T and o(T) would then have an edge in common. Hence 
the distance between o(a1) and o(az) is larger than two and because this 
distance must be an even number it is at least four. Also both vertices o(a1) 
and o(a3) are odd vertices of T. Hence one of the vertices o(a;) or o(a3) 
must be equal to a;. If a(a,) = a; then the |A| + 1 even vertices adjacent to 
a, must be mapped by o to even vertices not adjacent to a;. But there are 
only |C| +1 < |A|+1 even vertices not adjacent to a1. If c(a3) = a; then the 
|C| + 2 even vertices adjacent to ag must be mapped by o to even vertices 
not adjacent to a ;. But there are only |C| +1 < |C|+ 2 even vertices not 
adjacent to ay. 


Case 2. The two packing o is an unequal parity two-packing. 


The vertex a4 is adjacent to all odd vertices except a1. Hence o(a4) must 
be an odd endpoint of T,, that is o(a4) is some endpoint adjacent to a4 and 
a(a,) = a4. But this implies that o(a,) can only be adjacent to a, in o(T). 
Hence a, must be an endpoint of T, a contradiction. 


Lemma 12. [fT is a tree of diameter five which is not a crested dragon but 
every equal parity reduction of T is a crested dragon or a tree of diameter 
less than five then T has an equal parity two-packing. 


Proof. We assume first that T contains two equal parity reducing endpoints x 
and y such that T — {, y} is a crested dragon D. Let m be the node adjacent 
to z and n the node adjacent to y. Choose a path P = ag, a1, a2, a3, 44, a5 in 
the crested dragon D such that the vertices a1, a4 and possibly a3 are nodes 
of D but no other vertices are nodes of D. Note that P is a derived tree of T. 
Denote by A the set of endpoints adjacent to a,, by B the set of endpoints 
adjacent to a4 and by C the set of endpoints adjacent to a3. Then |A| > |C| 
and C' might be empty. Observe that the nodes m and n are vertices of D. 
We are going to discuss several cases depending on which vertices of D are 
the nodes m and n of T. Clearly {m,n}N (AUB) = 0 otherwise the diameter 
of T would be larger than five. 

We assume first that both nodes m and n are elements of C’. Let then R 
be the subtree of T’ which is spanned by the vertices of P together with the 
vertices m, n, x and y. Clearly R is a derived tree of T. Figure 7 exhibits a 


The Chromatic Number of the Two-Packing of a Forest 153 


perfect packing of R. It follows then from the First Preservation Lemma that 
T has an equal parity two-packing. (The two-packing o maps a bold labelled 
vertex 7 to the gray labeled vertex i. The reason for using numbers and not 
letters in Fig. 7 is to make it obvious that the graph F, is bipartite. Indexed 
letters take too much space in the figure.) 


Fig. 7 


If only one of the nodes m and n, say m, is an element of C’, then the 
other node n must be equal to some vertex of even parity in D. Hence n 
is equal to ag or ag. If nm = ag let R then be the subtree of T which is 
spanned by the vertices of P together with the vertices m and y. Clearly R is 
a derived tree of T’. Figure 8 exhibits a perfect packing of R. It follows from the 
First Preservation Lemma that T has an equal parity two-packing. If m is an 
element of Cand n = a, we consider R to be the subtree of T which is spanned 
by the vertices of P and the vertices m and x. Note that a2 is now not a node 
of T’. Clearly R is a derived tree of T. Figure 9 exhibits a bipartite two-packing 
of R in which every vertex splits except the vertex a2, which corresponds 
to 2 in the figure. It follows then from the First Preservation Lemma that 
T has an equal parity two-packing. We arrived now at the situation that 
neither m nor n is an element of C’. Because m and n have the same parity 
it follows that either {m,n} = {a1,a3} or that {m,n} = {a2,a4} holds. If 
{m,n} = {a1,a3} then T is a crested dragon, contrary to our assumptions on 
T. (Because |A| > |C| clearly |A] +1 > |C| +1). If {m,n} = {a2, a4} and C 
is not empty then the same graph R as used for Fig. 8 is a derived tree of T. 
If C is empty then T is a crested dragon, (label the path P beginning at as 
with a9), contrary to the assumption on T. 

We can now assume that whenever z and y is an equal parity reducing pair 
of endpoints of T’,, then the diameter of the tree K = T—{x, y} is smaller than 
five. Because T has diameter five it contains a path P = ao, a1, a2, a3, a4, a5 
of length five such that the vertices aj and as are endpoints of T and a, and 
a4 are nodes of T’. Since T is not a crested dragon, the tree T' has at least one 
node different from a; and a4. The tree T has at most two more nodes besides 
a, and a4, otherwise two of those nodes would have the same parity and T 
could be further equal parity reduced to a tree which has diameter five. If T 
has two nodes besides a; and a4 we will denote them by n and m and if T has 
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only one such a node we will denote it by m. Note that the two nodes m and 
n have different parity otherwise T’ could be equal parity reduced to a tree 
which has diameter five. For a vertex a in T the distance of a from P is the 
length of the shortest path from a to any vertex of P. The vertex of P which 
has shortest distance from a is called the point of attachment of a. Because 
T has diameter five there is no vertex in T’ which has distance greater than 
or equal to three from P. This means that the nodes m and n have distance 
at most one from P. 

If both nodes m and n have distance one from P the points of attachment 
of m and n are in the set {a2,a3}. Otherwise T would have diameter at 
least six. If the points of attachment of m and n are different we may 
assume without loss of generality that the point of attachment of m is a2 
and the point of attachment of n is a3. If we remove all endpoints from T 
except the endpoints ao and as we arrive at the subtree of T spanned by 
the vertices ao, @1, 42, 43,44,45,m,n. We see from Fig.8 that this tree has 
a perfect packing. Hence by the First Preservation Lemma, T’ has an equal 
parity two-packing. If m and n have the same vertex as point of attachment 
then m and n would have the same parity. 

Hence at most one node, say m, has distance one from P while the other 
node is a vertex in P. As above the point of attachment of m is one of the 
points a2 or a3 and we may assume without loss of generality that ag is 
the point of attachment of the node m. The node n can not be one of the 


The Chromatic Number of the Two-Packing of a Forest 155 


endpoints of JT adjacent to a; or a4 because the diameter of T' is five. By 
assumption n is not equal to a; or a4. If n is equal to az then m and n would 
have the same parity. We conclude that n = a3 holds. The nodes a3 = n and 
a, of T have the same parity. If a1 would be adjacent to more than the one 
endpoint ao then T could be further equal parity reduced while still containing 
a path of length five. Hence a, is only adjacent to the one endpoint ag. If we 
remove all endpoints from T except the endpoints ap and as we arrive at the 
subtree of T spanned by the vertices ao, a1, a2,a3 = N,a4,a5,m. We see from 
Fig. 10 that this tree has an equal parity two-packing in which every vertex, 
except the vertex a1, splits. Hence by the First Preservation Lemma the tree 
T has an equal parity two-packing. 

The next case to investigate is the one in which both of the nodes m and 
n of T are vertices of the path P. Because the diameter of T is five the nodes 
m and n can not be endpoints of P and by assumption m and n are different 
from a; and a4. Also, m and n are different from each other. Hence without 
loss, m = ag and n = a3. There is then at least one endpoint of T adjacent to 
az and at least one endpoint of T adjacent to a3. We can now use Fig. 8 and 
the First Preservation Lemma to conclude that T has a bipartite packing. 

We assume from now on that there is only the one node m in T which is 
not equal to a, or aq. If m has distance one from P we may assume without 
loss that the point of attachment of m is a3. Because m is a node there exists 
at least one endpoint a7 of T adjacent to m. If we remove all endpoints from 
T except the endpoints ao, as; and a7 we arrive at the subtree of T’ spanned 
by the vertices ao, a1, a2, A3, 4, a5,a7,m. We see from Fig. 9 that this tree has 
an equal parity bipartite packing in which every vertex splits, except for the 
vertex @2 which is not a node of T. Hence by the First Preservation Lemma, 
T has an equal parity two-packing. 

The last case is when m is a vertex in P. As before we may assume 
without loss that m = a3 holds. The node m = az as adjacent to at least 
one endpoint, say ag. The parity of m = a3 is the same as the parity of 
the node a ,. Hence the node a; is adjacent to at most one endpoint of T, 
otherwise T could be further equal parity reduced while still containing a path 
of length five. If we remove all endpoints from T except the endpoints co, a5 
and ag we arrive at the subtree of T spanned by the vertices ao, a1, a2,4a3 = 
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M,Q4,@5,ag. We see from Fig. 10 that this tree has an equal parity bipartite 
packing in which every vertex, except the vertex ag, splits. Hence by the First 
Preservation Lemma, T has an equal parity two-packing. 


Lemma 13. A tree T of diameter five has an equal parity two-packing if and 
only if T is not a crested dragon. 


Proof. If T is a crested dragon it follows from Lemma 11 that T does not 
have a bipartite two-packing. If T’ has an equal parity reducing pair a and b 
of endpoints such that T — {a, b} is not a crested dragon and has diameter at 
least five then, using the Second Preservation Lemma, we have to consider 
the tree T' — {a,b} instead. Because T is finite we can assume that for every 
equal parity reducing pair a and b of endpoints of T, T — {a,b} is either a 
crested dragon or has diameter smaller than five. By Lemma 12, the tree T 
has then an equal parity two-packing. 


Lemma 14. Every tree T of diameter larger than or equal to five which is 
not a crested dragon has an equal parity two-packing. 


Proof. By the Second Preservation Lemma we may assume that any equal 
parity reduction of T either yields a tree of diameter less than five or a crested 
dragon. Because of Lemma 13 we may also assume that the diameter of T 
is larger than five. If T’ does not contain an equal parity reducing pair of 
endpoints then because the diameter of T is larger than five, T’ would have 
to be a dragon of diameter at least seven. Because every path of length at 
least seven has a perfect packing (Lemma 6), every dragon of length at least 
seven has an equal parity two-packing. We may therefore further assume that 
the tree T contains a pair of endpoints a, b of equal parity which are adjacent 
to two different nodes. 


Case 3. The tree T contains endpoints a,b of equal parity which are adjacent 
to two different nodes such that the tree T — {a,b} is a crested dragon D. 


We assume that the path a ,,a2,a3,a4 is a derived tree of the crested 
dragon D such that a,,a4 and possibly a3 are the nodes of D. Let A be the 
set of endpoints of D adjacent to a;,B the set of endpoints of D adjacent 
to a4,C the set of endpoints of D adjacent to a3 and assume that ao is an 
endpoint in A and that as is an endpoint in B. Because the diameter of T' is 
larger than five it follows that at least one of the endpoints a or b of the equal 
parity reducing pair of endpoints of T is adjacent to one of the endpoints in 
the set AU B. 

If both vertices a and b are adjacent to vertices in the set AU B then, 
because a and b have the same parity, they are either both adjacent to vertices 
in A or they are both adjacent to vertices in B. In either case the tree R, for 
which Fig. 10 exhibits an equal parity two-packing a, is a derived tree of T’. 
The two-packing o splits all of the vertices of R except the vertex labelled 2. 
Using the First Preservation Lemma we conclude that T has an equal parity 
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two-packing except in the case where both endpoints a and 6 are adjacent 
to vertices in B and the set C' is not empty. In this case we use the tree for 
which Fig. 11 exhibits a perfect packing and which is then a derived tree of 
T. We are therefore now in the situation that exactly one of the endpoints 
a and b, say a, is in the set AU B. Assume first that a € B. Because a and 
b have the same parity the endpoint b is not adjacent to a vertex in C. The 
tree T then contains the path P = ag, a1, a2, 43, 44,a5,a¢ = a of length six 
and every endpoint of T' is adjacent to some vertex in this path. If the center 
a3 of P is not a node of T we are done by the First Preservation Lemma and 
the following Fig. 12 which exhibits a bipartite two-packing of Ps in which 
every vertex except the center splits. If a3 is a node of T we may assume 
that a3 is adjacent to ag. The vertices of the path P together with ag span a 
subtree R of T which is a derived tree of T’. If the node as of T is adjacent to 
more than one endpoint then T would have an equal parity reducing pair of 
endpoints « and y such that T — {x,y} has diameter six. Hence we assume 
without loss of generality that as is adjacent to only one endpoint. Figure 13 
exhibits an equal parity two-packing of R in which all of the nodes which 
belong to endpoints of T which are not in R are split. Hence T has an equal 
parity two-packing. 


Fig. 12 


Assume next that a is adjacent to some endpoint, say ag in A. The tree T 
contains then the path P = a, do, a1, a2, a3, a4, a5 of length six. If b is adjacent 
to an element of C then ag is not a node of T. Hence we can use the tree 
exhibited in Fig. 10 together with the First Preservation Lemma to construct 
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Fig. 13 


an equal parity two-packing of T’. If 6 is not adjacent to a vertex in C but 
to the center ag of P we use Fig. 14 and if b is neither adjacent to a2 nor to 
a vertex in C' we use Fig. 13 together with the First Preservation Lemma to 
construct an equal parity two-packing of T. 


Case 4. For any two endpoints a,b of equal parity which are adjacent to two 
different nodes, the tree T — {a,b} has diameter four. 


Let P be the path a = ao, 1, @2,43,@4,45,ag = b of length six which 
joins the endpoints a and 6 in T. Observe that T does not contain a node, 
say m, different from the nodes a, and a; which has parity 1. Otherwise some 
endpoint x adjacent to m would form an equal parity reducing pair with ao 
and T — {x,ag} would have diameter five. If T would contain two nodes, say 
m and n, besides the nodes a; and as they would both have to have even 
parity. But then T’ could be equal parity reduced by some two endpoints 
adjacent to m and n respectively and still have diameter six. If T’ contains 
no other node besides the nodes a; and as then T is a dragon of diameter six 
which has an equal parity two-packing by Fig. 12 and the First Preservation 
Lemma. 

It follows that the tree T has exactly three nodes, namely a,, a; and a 
third one, say m which has even parity. The distance from m to either one of 
a, and as is smaller than four. Otherwise we could reduce T by the pair ao, 
ag of endpoints and obtain a tree of diameter at least five. Hence m is either 
a vertex in P or has distance one from a3. If m is a vertex in P different from 
the center a3 we are done by Fig. 12 and if m is adjacent to a3 or equal to a3 
we are done by Fig. 13 and the First Preservation Lemma. 


Theorem 4. A forest has a bipartite packing if and only if it is either a 
singleton verter or disconnected or a tree of diameter at least five which is 
not a crested dragon. Every forest which has a bipartite packing has an equal 
parity packing. 


Proof. Follows immediately from Lemmas 5, 10, 13, and 14. 
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4. Two-Packings Which Yield a Graph of Chromatic 
Number Three 


Lemma 15. Every tree with diameter at least four has a three chromatic 
two-packing. 


Proof. Using the Second Preservation Lemma we may assume that if a,b is 
any reducing pair of endpoints of T then T — {a,b} does not have diameter 
at least four. This implies that T has at most three different nodes. 


Case 5. The tree T has exactly three nodes. 


Assume that the nodes are the vertices a,, bj and c, and that a is an 
endpoint adjacent to a,, b is an endpoint adjacent to b; and that c is an 
endpoint adjacent to c;. Assume also that the distance from a to b is the 
diameter of T. The diameter of T can not be larger than or equal to five 
because then the tree T — {b, c} would still have diameter at least four. Hence 
there is a path P = a,a,,d,b,,b of length four from a to b. Let the point 
of attachment of c, be the vertex in the path P which has shortest distance 
to cy. The point of attachment of c, can not be a or b because a and b are 
endpoints of T. If the point of attachment of c, is a; then c; is not a vertex 
in P, otherwise c; = a, contrary to the assumption that the three nodes ay, 
b; and c, are pairwise different. But c; has then at least distance one from 
a, and the distance from c to b would be at least five. Similarly the point 
of attachment of c; is not equal to 6;. Hence the point of attachment of c, 
must be the central point d of the path P. The distance from c, to d is at 
most one otherwise the distance from c to a would be at least five. Hence the 
distance from c; to d is either one or zero. In either case the tree R which 
consists of the path P of length four together with one endpoint x adjacent 
to the central point d will be a derived tree of the tree T. Figure 14 shows a 
two-packing of the tree R of chromatic number three in which every vertex 
splits. Using the First Preservation Lemma we conclude that T has a three 
chromatic two-packing. 


Case 6. The tree T has exactly two nodes. 


Fig. 14 


This means that T is a dragon. Assume that the nodes are the vertices a1 
and 6; and that a is an endpoint adjacent to a; and 6 is an endpoint adjacent 
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to bi. Observe that either a, or 6; is adjacent to exactly one endpoint. 
Otherwise T can be further reduced to a tree which has diameter at least 
four. For the same reason the dragon T has diameter four or five. In one 
case a path of length four and in the other case a path of length five is a 
derived tree of T. Because at least one of the two nodes is adjacent to only 
one endpoint it is sufficient to find a two-packing of a path of length four and 
a two-packing of a path of length five in which at least one of the nodes splits 
and which has chromatic number three. Such two-packings are exhibited in 
Fig. 15. 


Fig. 15 


Lemma 16. A forest F which is not connected has a three chromatic two- 
packing if and only if F contains a path of length two. 


Proof. If F does not contain a path of length two then the degree of every 
vertex of F' is at most one and F consists of a partial matching (that is a set 
of pairwise not adjacent edges), together with some isolated vertices. Hence 
for any two-packing o of F the maximal degree of FZ is at most two and 
the edges of F can be factorized into two matchings. This means that every 
connected component of F, is either a path or a circuit. Since an odd circuit 
can not be factorized into two matchings each of the circuits has even length 
which implies that the chromatic number of F, is at most two. 

Let us now assume that F’ contains a path of length two. Using the Second 
Preservation Lemma we may assume that F can not be reduced without 
yielding a forest which has no path of length two. That implies that F’ has 
at most two connected components which contain a path of length one or 
two and all other components of F are isolated points. Otherwise F’ could 
be further reduced. Observe that every isolated point has a bipartite two- 
packing. This means that we can assume that F’ has at most two connected 
components and that if F contains an isolated point then F' has exactly two 
components, one the isolated point and the other a tree of diameter two or 
three. There are three more cases to be discussed. In each case, a two-packing 
o of F is exhibited such that every vertex of F' is split and F, has chromatic 
number three. 


Case 7. The forest F has two connected components A and B which contain 
a path of length two. 
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Since the forest F' can not be further reduced, each of the two connected 
components A and B is a path of length two. Figure 16 exhibits a two-packing 
o such that F, has chromatic number three. 


Fig. 16 


Case 8. The forest F has only one component A which contains a path of 
length two and an isolated point e. 


The forest F’ consists then of two components, A and the isolated point e. 
The diameter of A is at most three, otherwise F’ could be further reduced. 
We conclude that A is either a star or a dragon of diameter three. If A is a 
dragon of diameter three we can assume that at most one of the two nodes has 
degree larger than or equal to three, otherwise F' could be further reduced. 
Figure 17 exhibits for both cases a two-packing o such that F has chromatic 
number three. 


Case 9. The forest F has only one component A which contains a path of 
length two and no isolated point. 


Then the other component of F contains exactly one edge. Since F' can 
not be reduced further, F’ must consist of a path of length two and a second 
component of diameter one. Figure 18 exhibits a two-packing o such that F, 
has chromatic number three. 


Fig. 18 
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Theorem 5. A forest F has a two-packing o such that F, has chromatic 
number three if and only if F is a tree with diameter at least four or F is 
disconnected and contains a path of length two. 


Proof. If F’ is disconnected then the theorem follows from Lemma 16. If F 
is connected and contains a path of length four the theorem follows from 
Lemma 15. If F is connected and has diameter zero, that is F’ is an isolated 
point, then for every two-packing o of F’, the graph F, has chromatic number 
one. If F is connected and has diameter one or diameter two, then F’ does 
not have a two-packing, by Lemma 3. If F' is connected and has diameter 
three then it follows from Lemma 4 that for every two packing o of F’, the 
graph F, has chromatic number four. 


5. Two-Packings Which Yield a Graph of Chromatic 
Number Four 


For n > 3, the wheel W,, is a graph which consists of a cycle with n vertices 
and an additional vertex which is adjacent to all of the vertices in the cycle. 
The complete graph K, for example is the wheel W3. 


Lemma 17. A graph G with at most six vertices and chromatic number four 
contains the complete graph K4 or the wheel Ws as a subgraph. 


Proof. We only need consider the cases that G has five vertices and that G 
has six vertices. If G has six vertices and does not contain a triangle, then 
because G has chromatic number four it must contain a circuit C of length 
five. Because G does not contain a triangle C has no chords. If the sixth 
vertex is not adjacent to all of the vertices of C, G would have chromatic 
number three and if the sixth vertex is adjacent to all vertices of C then 
G is the wheel W;. If G contains a triangle with vertices a, b, c then color 
the vertex a with color 0 the vertex 6 with color 1 and the vertex c with 
color 2. If G does not contain the complete graph K4, then each of the other 
three vertices x, y, z is adjacent to at most two of the vertices in the triangle 
spanned by a, }, c. If two of the vertices x, y and z are adjacent then they are 
not adjacent to the same two vertices of the triangle spanned by a, b, c. We 
color the vertices x, y and z with the colors 0, 1 and 2 in such a way that none 
of the vertices x, y, z is adjacent to a vertex in the set {a,b,c} of vertices 
which has the same color. This can be done because each of the vertices x, 
y, z is adjacent to at most two of the vertices in the triangle spanned by 
a, b, c. We also assume that the number of adjacent pairs of vertices which 
received the same color is as small as possible under the coloring condition 
given above. Observe that if all three colors are used to color the vertices x, 
y and z then G would have a good three coloring. Hence we can assume that 
at least two of the three vertices x, y and z have the same color. 
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Assume first that all three vertices x, y and z have the same color, say 2. 
If no two of them are adjacent then we arrived at a three coloring of the 
graph G. Hence assume that the vertices « and y are adjacent. At least one 
of the two vertices 7 and y, say y is not adjacent to one of the two vertices, 
say b. We can then color y with the color 1 instead of with the color 2 and 
obtain a coloring in which there are fewer adjacent pairs of vertices which 
have the same color. 

Assume next that the vertices x and y have the same color, say 2, and that 
the vertex z has a different color, say 1. The vertices x and y are adjacent, 
otherwise the given coloring would be a good three coloring of the graph G. 
If one of the two vertices x and y, say y, is not adjacent to the vertex a, 
then y could be colored with the color 0 to obtain a good three coloring of 
G. Hence both vertices x and y are adjacent to the vertex a. This means 
that one of the two vertices x and y, say y is not adjacent to the vertex b. 
If the vertices z and y would not be adjacent then coloring y with the color 
1 instead of 2 would give a good three coloring of G. Hence y is adjacent 
to z. If the vertex z would not be adjacent to the vertex a we would obtain 
a good three coloring of G by changing the color or z from 1 to 0 and the 
color of y from 2 to 1. Hence z is adjacent to a. The vertices z and x are not 
adjacent otherwise the vertices x, y, z and a would form a Ky. This implies 
that x is adjacent to b otherwise x could be colored with color 1 to give a 
good three coloring of G. Also the vertices z and c are adjacent, otherwise 
we could change the color of z from 1 to 2 and the color of y from 2 to 1 to 
obtain a good three coloring of G. This now finally implies that G contains 
the wheel Ws; because x, y, z,c, b,x form a cycle of length five and the vertex 
a is adjacent to the other five vertices. 

If G has five vertices let G, be the graph obtained from G by adding an 
additional isolated point. We can then apply the result for six vertices to G 
and obtain the lemma in the case of five vertices as well. 


Lemma 18. /f A and B is a factorization of the complete graph K4 into 
two forests, then both of the factors A and B are isomorphic to the path P3 
of length three. 


Proof. If one of the factors A or B contains less than three edges the other 
factor must contain at least four edges and hence a circuit. This means that 
both of the factors A and B contain exactly three edges. Hence each of the 
factors A and B is either the path P3 or the star $3. The complement of S3 in 
K,4 is a triangle. Hence both factors A and B are isomorphic to the path P3. 


Lemma 19. A forest F with exactly two connected components P and S, 
where P is a path of length three and S is a star S, with n > 1, does not 
have a two-packing a such that F, has chromatic number four. 


Proof. Assume for a contradiction that Fj has chromatic number four. Let 
G be the graph obtained from F, by removing all vertices from F, which 
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have degree at most two. Clearly, fF, has chromatic number four if and only 
if G has chromatic number four. Because F’ contains at most three vertices 
which have degree larger than one, the graph G contains at most six vertices. 
Hence by Lemma 17, G and therefore F,, contains the complete graph Ky, or 
the wheel W;. According to Lemma 18, if A, B is a two-factorization of K4 
into two forests then both factors A and B are isomorphic to P;. Because 
F contains only one path of length three, the two factors A and B are a 
two-packing of P3. But this is not possible because the star S does not have 
a two-packing. 

Let us now assume that G contains the wheel W;. The wheel W; contains 
six vertices each of which has degree at least three. This means that the 
vertices of this wheel consist of the three vertices of F' which are not endpoints 
of F together with their o images. Notice that if y is a vertex in Ws which is 
not an endpoint of F then both o~!(y) and o(y) is an endpoint of F’. Let a 
be the center of the star S. The vertex x of W5 which is adjacent to all other 
vertices of Ws has degree five. We may assume without loss of generality that 
three of the edges adjacent in G to x are edges of F' and hence that x = a. 
The two central points c, d of P are then in Ws adjacent to 2 = a. The two 
edges of Ws from c, d to x = a must be edges of o(F). But then a~!(a) could 
not be an endpoint of F’. 


Lemma 20. A forest F which does not contain a path of length three does 
not have a two-packing o such that Fy has chromatic number four. 


Proof. Let us direct the edges of F' in such a way that every directed edge 
points towards an endpoint of F’. Then we direct the edges of o(F’) such that 
o preserves the directions. This makes F, into a directed graph. Observe 
that a vertex in F, has degree larger than or equal to three if and only if its 
indegree is at most one. If F, has chromatic number four then the graphs 
which we obtain from F, by successively removing vertices of degree at most 
two will also have chromatic number four. Hence this process of removing 
vertices of degree at most two ends in a graph G of chromatic number 
four in which every vertex has degree at least three. This is a contradiction 
because in G' every vertex has indegree at most one and hence outdegree at 
least two. 


Lemma 21. Every tree T of diameter at least three has a two packing o 
such that F, has chromatic number four. 


Proof. By the Second Preservation Lemma we may assume that T is reduced, 
that means that whenever a, b is a reducing pair of endpoints then T — {a, b} 
has diameter smaller than three. This implies that TJ’ has diameter either three 
or four and if T has diameter four then T' has exactly two endpoints, that is 
T is a path of length four, and if T has diameter three then T is a dragon of 
diameter three because every tree of diameter three is a dragon. Observe that 
in both cases the path of length three is a derived tree. Any factorization of 
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the complete graph Ky into two paths of length three results in a two packing 
o of the path of length three, in which every vertex splits. 


Lemma 22. If F is a forest which contains at least one path of length three 
and if F does not have exactly two components one of which is a star and 
the other a path of length three then F has a two-packing a such that F, has 
chromatic number four. 


Proof. By the Second Preservation Lemma we may assume that F' is reduced, 
that is if a and 6 is a reducing pair of vertices then F' — {a, b} is a forest which 
either does not contain a path of length three or F — {a,b} has exactly two 
components one of which is a star and the other a path of length three. If 
F has more than two components A, By, Bo,..., Bn with n > 2, where A 
contains a path of length three then let o be a two-packing of A such that Fy 
has chromatic number four, (Lemma 21), and A a two-packing of the forest 
which consists of the other components. Clearly g U A has then the required 
properties. We may therefore assume that F’ has exactly two components A 
and B where A has diameter at least three. If B is not a star then again the 
union of a four chromatic two-packing of A and a two-packing of B will be 
a four chromatic two-packing of F’. We assume therefore that B is a star. If 
A has diameter larger than or equal to five then F could be reduced further 
by a pair of endpoints, one from B and the other from A. If A has diameter 
four then A has exactly two endpoints or else we could reduce F' by a pair of 
endpoints one from A and the other from B. We arrived then at the case where 
A is a path of length four and B a star with at least two vertices. Figure 19 
exhibits a two packing o for this case such that F, has chromatic number 
four. If A has diameter three then A is a dragon of diameter three. If A would 
have more than three endpoints then F' could be further reduced. Hence A 
has exactly three endpoints, otherwise A would be a path of length three. 
Figure 20 exhibits a two packing such that Ff has chromatic number four. 


Fig. 19 


Theorem 6. A forest F has a two-packing a such that F, has chromatic 
number four if and only if F contains a path of length three and F does not 
consist of exactly two connected components where one of the two components 
is a path of length three and the other is a star. 


166 Hong Wang and Norbert Sauer 


Fig. 20 


Proof. Theorem 6 follows immediately from Lemmas 19, 20, 21, and 22. 
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II. Ramsey and Extremal Theory 


Introduction 


It is perhaps evident from several places of this volume that Ramsey theorem 
played a decisive role in Erdés’ combinatorial activity. And perhaps no other 
part of combinatorial mathematics is so dear to him as Ramsey theory 
and extremal problems. He was not creating or even aiming for a theory. 
However, a complex web of results and conjectures did, in fact, give rise 
to several theories. They all started with modest short papers by Erdés, 
Szekeres and Turan in the thirties. How striking it is to compare these initial 
papers with the richness of the later development, described, e.g., by the 
survey articles of Miki Simonovits (extremal graph theory) and Jeff Kahn 
(extremal set theory). In addition, the editors of these volumes tried to cover 
in greater detail the development of Ramsey theory mirrored and motivated 
by Erdés’ papers. In a way (and this certainly is one of the leitmotivs of 
Erdés’ work), there is little difference between, say, density Ramsey type 
results and extremal problems. 

One can only speculate on the origins of density questions. It is clear 
that in the late 1930s and 1940s, the time was ripe in ideas which later 
developed into extremal and density questions: we have not only the Erdés- 
Turan 1941 paper but also, Erdés and Tomsk 1938 paper on number theory 
which anticipated extremal theory by determining n?/? as upper bound for 
C4-free graphs, the Sperner paper and also the Erdés—Ko—Rado work (which 
took several decades to get into print). All these ideas, together with Turan’s 
extremal results provided a fruitful cross-interaction of ideas from various 
fields which, some 30 years later, developed into density Ramsey theorems 
and extremal theory. We are happy to include in this chapter papers by 
Gyula Katona (which gives a rare account of Erdés method of encouraging 
and educating young talented students). The article by Vojtéch Rédl and 
Robin Thomas related to another aspect of Erdés’ work was the start of many 
further refinements of arrangeability for special classes of sparse graphs (see J. 
NeSetril, P. Ossona de Mendez: Sparsity, Springer 2012 for a recent discussion 
of the problem). We also include an important paper by Alexander Kostochka 
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which gives a major breakthrough to the Erdés-Rado A-system problem 
(for triples; meanwhile Kostochka succeeded in generalizing the result to k- 
tuples). 

Finally, we have the paper of Saharon Shelah, solving a model theoretic 
Ramsey question of Vaananen, illustrating (once again) that Ramsey theory 
is alive and well. 

In 1995/1996, when the content of these volumes was already crystallising, 
we asked Paul Erdés to isolate a few problems, both recent and old, for each 
of the eight main parts of this book. To this part on Ramsey theory he 
contributed the following collection of problems and comments. 


Erdos in his own words 
Hajnal, Rado and I proved 
ack” < rs(k,k) <2? , 


we believe that the upper bound is correct or at least is closer to the truth, 
but Hajnal and I have a curious result: If one colors the triples of a set of 
n elements by two colors, there always is a set of size (log n)!/? where the 
distribution is not just, i.e., one of the colors has more than 0.6(3) of the 
triples for t = (logn)!/?. Nevertheless we believe that the upper bound is 
correct. It would perhaps change our minds if we could replace 0.6 by 1 — € 
for some t, t > (logn)*. We never had any method of doing this. 

Let H be a fixed graph and let n be large. Hajnal and I conjectured that 
if G(n) does not contain an induced copy of H then G(n) must contain a 
trivial subgraph of size > n°, « = e(H). We proved this for many special 
cases but many problems remain. We could only prove exp(c,/log n). 


Extremal Graph Theory 


Denote by T(n;G) the smallest integer for which every graph of n vertices 
and T(n;G) edges contains G as a subgraph. Turdn determined T(n; G) if 
G is K(r) for all r, T(n,Cy) = ($ + o(1))n3/? was proved by V. T. Sos, 
Rényi and myself. The exact formula for T(n,C,) is known if n = p?+p+1 
(Fiiredi). 

I have many asymptotic and exact results for T(n, C4) and many results 
and conjectures with Simonovits. I have to refer to the excellent book of 
Bollobas and the excellent survey of Simonovits and a very good recent survey 
article of Firedi. Here I only state two results of Simonovits and myself: 
T(n,G) < en8/> where G is the edge graph of the three-dimensional cube. 
Also we have fairly exact results for T'(n, K (2, 2,2)). 


Ramsey—Turan Theorems 


The first papers are joint with V. T. Sds and then there are comprehensive 
papers with Hajnal, Simonovits, V. T. Sés and Szemerédi. Here I only state 
a result with Bollobas, Szemerédi and myself: For every e > 0 and n > no(e) 
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n> 


there is a graph of n vertices {-(1—) edges with no K(4) and having largest 
independent set o(n), but such a graph does not exist if the number of edges 
is m1 +). We have no idea what happens if the number of edges is cae 
One last Ramsey type problem: Let n, be the smallest integer (if it exists) 
for which if we color the proper divisors of nz, by k colors then nz will be a 
monochromatic sum of distinct divisors, namely a sum of distinct divisors in 
a color class. I am sure that nz exists for every & but I think it is not even 
known if n» exists. It would be of some interest to determine at least no. 
An old problem of R. L. Graham and myself states: Is it true that if m, is 
sufficiently large and we color the integers 2 < t < mx by k colors, then 


1 
l= — 
ae 
is always solvable monochromatically? I would like to see a proof that m2 
exists. (Clearly m, > ng.) Perhaps this is really a Turdn type problem and 
not a Ramsey problem. In other words, if m is sufficiently large and 1 < a, < 
ag <+++ < ag < mis a sequence of integers for which )°,, /ae > dlogm, then 
ej 
L= — (eé;=O0orl 
Te |=oorn 
is always solvable. I offer 100 dollars for a proof or disproof. Perhaps it suffices 
to assume that 


S- - > C(log log m)? 
ay<m—* 
for some large enough C. For further problems of this kind as well as for 
related results see my book with R. L. Graham. I hope before the year 2000 
a second edition will appear. 
Pree 

So much for Paul Erdés. Of course progress since 1995/1996 has been very 
rapid, particularly so in extremal and Ramsey theory. Most of the papers were 
updated for this second edition. Note that the main result of the paper by 
Shelah in this chapter was improved very recently by D. Conlon, J. Fox and 
B. Sudakov (Two extensions of Ramsey theorem, to appear in Duke Math. 
J.): the order of the modified Ramsey function is single exponential. 

We are fortunate that Sasha Razborov contributed to our volume with a 
new survey on an exciting new tool for extremal theory problems—the flag 
algebra calculus. (See also the recent book L. Lovadsz: Graph Limits, AMS 
2012.) 

At this place let us comment on the last part of Erdés’ text: The problem 
of Erdés and Graham has been solved by Ernie Croot (then only a student) 
in 
E. Croot, On a coloring conjecture about unit fractions, Ann. Math. 157 
(2003), 545-556. 
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On the other hand the second edition of 
P. Erdés, R. L. Graham, Old and new problems and results in combinatorial 
number theory, Monographie L’Enseignements Math. 28,(1980) definitively 
did not appear before 2000 (in fact it has not even appeared up to now 
(2013)). Well, even Paul Erdés was sometimes wrong! 
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Summary. Ramsey’s theorem was not discovered by P. Erdés. But perhaps one 
could say that Ramsey theory was created largely by him. This paper will attempt 
to demonstrate this claim. 


1. Introduction 


Ramsey’s theorem was not discovered by Paul Erdés. This was barely 
technically possible: Ramsey proved his theorem in 1928 (or 1930, depending 
on the quoted source) and this is well before the earliest Erdés publication in 
1932. He was then 19. At such an early age 4 years makes a big difference. And 
also at this time Erdds was not even predominantly active in combinatorics. 
The absolute majority of the earliest publications of Erdés is devoted to 
number theory, as can be seen from the following table: 


1932 | 1933 | 1934 | 1935 | 1936 | 1937 | 1938 | 1939 


All papers a 0 5 10 {11 |10 |13 |13 


Number theory | 2 | 0 5 | 9 10 | 10 12 | 13 


The three combinatorial exceptions among his first 8? papers published 
in 8 years are 2 papers on infinite Eulerian graphs and the paper [46] by 
Erdos and G. Szekeres. Thus, the very young P. Erddés could not have been a 
driving force for the development of Ramsey theory or Ramsey-type theorems 
in the 30s. That position should be perhaps reserved for Issac Schur who not 
only proved his sum theorem [114] in 1916 but, as it appears now [115], 
also conjectured van der Waerden’s theorem [124], proved an important 
extension, and thus put it into a context which inspired his student R. Rado 
to completely settle (in 1933) the question of monochromatic solutions of 
linear equations [102]. This result stands apart even after 60 years. 

Yet, in retrospect, it is fair to say that P. Erdés was responsible for the 
continuously growing popularity of the field. Ever since his pioneering work in 
the 30s he proved, conjectured and asked seminal questions which together, 
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some 40-50 years later, formed Ramsey theory. And for Erdés, Ramsey theory 
was a constant source of problems which motivated some of the key pieces of 
his combinatorial research. 

It is the purpose of this article to partially justify these claims, using a 
few examples of Erdos’ activity in Ramsey theory which we will discuss from 
a contemporary point of view. 

In the first section we cover paper [46] and later development in great 
detail. In Sect. 2, we consider the development based on Erdés’ work related 
to bounds on various Ramsey functions. Finally, in Sect.3 we consider his 
work related to structural extensions of Ramsey’s theorem. 

No mention will be made of his work on infinite extensions of Ramsey’s 
theorem. This is covered in this volume by the comprehensive paper of A. 
Hajnal. 


2. The Erd6s-Szekeres Theorem 


F. P. Ramsey discovered his theorem [104] in a sound mathematical context 
(of the decision problem for a class of first-order formulas; at the time, 
the undecidability of the problem was not known). But since the time of 
Dirichlet the “Schubfach principle” and its extensions and variations played 
a distinguished role in mathematics. The same holds for the other early 
contributions of Hilbert [67], Schur [114] and van der Waerden [124]. 

Perhaps because of this context Ramsey’s theorem was never regarded as 
a puzzle and/or a combinatorial curiosity only. Thanks to Erdés and Szekeres 
[46] the theorem found an early application in a quite different context, 
namely, plane geometry: 


Theorem 1 ([46]). Let n be a positive integer. Then there exists a least 
integer N(n) with the following property: If X is a set of N(n) points in the 
plane in general position (i.e. no three of which are collinear) then X contains 
an n-tuple which forms the vertices of a convex n-gon. 


One should note that (like in Ramsey’s original application in logic) this 
statement does not involve any coloring (or partition) and thus, by itself, fails 
to be of “Ramsey type”. Rather it fits to a more philosophical description of 
Ramsey type statements as formulated by Mirsky: 


There are numerous theorems in mathematics which assert, crudely 
speaking, that every system of a certain class possesses a large 
subsystem with a higher degree of organization than the original 
system. 


It is perhaps noteworthy to list the main features of the paper. What a 
wealth of ideas it contains! We can list at least 6 main aspects of this paper 
(numbered I-VI): 
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I. 


II. 


III. 


IV. 


It is proved that N(4) = 5 and this is attributed to Mrs. E. Klein. 
This is tied to the social and intellectual climate in Budapest in the 30s 
which has been described both by Paul Erdds and Szekeres on several 
occasions (see e.g. [30]), and with names like the Happy End Theorem. 
The following two questions related to statement of Theorem 1 are 
explicitly formulated: 


(a) Does the number N(n) exist for every n? 
(b) If so, estimate the value of N(n). 


It is clear that the estimates were considered by Erdés from the very 
beginning. This is evident at several places in the article. 

The first proof proves the existence of N(n) by applying Ramsey’s 
theorem for partitions of quadruples. It is proved that N(n) < 
r(2,4, {5,n}). This is still a textbook argument. Another proof based 
on Ramsey’s theorem for partitions of triples was found by A. Tarsi (see 
[(63]). So far no proof has emerged which is based on the graph Ramsey 
theorem only. 

The authors give “a new proof of Ramsey’s theorem which differs entirely 
from the previous ones and gives for m;(k,¢) slightly smaller limits”. 
Here m,(k, £) denotes the minimal value of |X| such that every partition 
of z-element subsets of X into two classes, say a and 3, each k-element 
contains an i-element subset of class a or each i-element subset contains 
an i-element subset of class £. 

Thus, m;(k,) is the Ramsey number for 2-partitions of i-element 
subsets. These numbers are denoted today by r(2,7, {k,1}) or ri(k, 1). 
The proof is close to the standard textbook proofs of Ramsey’s theorem. 
Several times P. Erdés attributed it to G. Szekeres. 

Erdés and Szekeres explicitly state that re(k + 1,@+ 1) = mo(k+ 
1lé+1j)< Ca) and this value remained for 50 years essentially the 
best available upper bound for graph Ramsey numbers until the recent 
improvements by Rédl, Thomason [122] and finally by Conlon [17]. 
The current best upper bound (for k = £) is 


2k log k 
k7C Weloek . 
( ‘) 


. It is not as well known that [46] contains yet another proof of the graph 


theoretic formulation of Ramsey’s theorem (in the above notation, i = 2) 
which is stated for its particular simplicity. We reproduce its formulation 
here. 


Theorem 2. In an arbitrary graph let the maximum number of 
independent points be k; if the number of points is N 2 m(k,£) then 
there exists in our graph a complete graph of order &. 
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Proof. For = 1,2, the theorem is trivial for any k, since the maximum 
number of independent points is k and if the number of points is (k+1), 
there must be an edge (complete graph of order 2). 

Now suppose the theorem proved for (¢— 1) with any k. Then at 
t A-k edges start from one of the independent points. Hence if 


N-k 
k 


leas 


2 m(k,£—1), 
i.e., 
NZk-m(k,€—1) +k, 


then, out of the end points of these edges we may select, in virtue of our 
induction hypothesis, a complete graph whose order is at least (¢ — 1). 
As the points of this graph are connected with the same point, they 
form together a complete graph of order @. 


This indicates that Erdés and Szekeres were well aware of the novelty 
of the approach to Ramsey’s theorem. Also this is the formulation of 
Ramsey’s problem which motivated some of the key pieces of Erdés’ 
research. First an early use of the averaging argument and then the 
formulation of Ramsey’s theorem in a “high off-diagonal” form: If a 
graph G has a bounded clique number (for example, if it is triangle- 
free) then its independence number has to be large. The study of this 
phenomenon led Erdés to key papers [25, 27, 28] which will be discussed 
in the next section in greater detail. 

VI. The paper [46] contains a second proof of Theorem 1. This is a more 
geometrical proof which yields a better bound 


N(n)< 3) ai 


and it is conjectured (based on the exact values of N(n) for n = 3, 4,5) 
that N(n) = 2”~? + 1. This is still an unsolved problem. The second 
proof (which 50 years later very nicely fits to a computational geometry 
context) is based on yet another Ramsey-type result. 


Theorem 3 (Ordered pigeon hole principle; Monotonicity lemma). 
Let m, n be positive integers. Then every set of (m—1)(n—1)+4+1 distinct 
integers contains either a monotone increasing n-set or monotone decreasing 
m-set. 


The authors of [46] note that the same problem was considered by R. 
Rado. The stage has been set. 

The ordered pigeon-hole principle has been generalized in many different 
directions (see e.g., [14, 90] and more recently [10]). 

All this is contained in this truly seminal paper. Viewed from a contem- 
porary perspective, the Erdés-Szekeres paper did not solve any well-known 
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problem at the time and did not contribute to Erdés’ instant mathematical 
fame (as a number theorist). But the importance of the paper [46] for the 
later development of combinatorial mathematics cannot be overestimated. 
To illustrate this development is one of the aims of this paper. 

Apart from the problem of a good estimation of the value of N there is a 
peculiar structural problem related to [46]: 

Call a set Y C X an n-hole in X if Y is the set of vertices of a convex 
n-gon which does not contain other points in X. Does there always exist 
N*(n) such that if X is any set of at least N*(n) points in the plane in 
general position then X contains an n-hole? 

It is easy to prove that N*(n) exists for n < 5 (see Harborth (1978) where 
these numbers are determined). Horton (1983) showed that N*(7) does not 
exist. The fact that N*(6) exists was established only very recently. The best 
bounds currently available are 30 < N*(6) < 463 (see [57, 96, 73, 97]). 


3. Estimating Ramsey Numbers 


Today it seems that the first question in this area which one might be tempted 
to consider is the problem of determining the actual sizes of the sets which 
are guaranteed by Ramsey’s theorem (and other Ramsey-type theorems). But 
one should try to resist this temptation since it is “well-known” that Ramsey 
numbers (of all sorts) are difficult to determine and even good asymptotic 
estimates are difficult to find. 

It seems that these difficulties were known to both Erdés and Ramsey. 
But Erdés considered them very challenging and addressed this question in 
several of his key articles. In many cases his estimations obtained decades ago 
are still the best available. Not only that, his innovative techniques became 
standard and whole theories evolved from his key papers. 

Here is a side comment which may partly explain this success: Erdés 
was certainly one of the first number theorists who took an interest in 
combinatorics in the contemporary sense (being preceded by isolated events, 
for example, by V. Jarnik’s work on the minimum spanning tree problem and 
the Steiner problem see [69] and e.g. [66] and more recent [89] for the history 
of the problem. Incidentally, Jarnfk was one of the first coauthors of Erdés.) 
Together with Turan, Erdés brought to the “slums of topology” not only his 
brilliance but also his expertise and “good taste”. It is our opinion that these 
facts profoundly influenced further development of the whole field. Thus it 
is not perhaps surprising that if one would isolate a single feature of Erdés’ 
contribution to Ramsey theory then it is perhaps his continuing emphasis on 
estimates of various Ramsey-related questions. From the large number of his 
results and papers we decided to cover several key articles and comment on 
them from a contemporary point of view. 
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I. The 1947 paper [25]. In a classically clear way, Erdés proved 


Qk? < r(k) < 4" (1) 
for every k > 3. 


His proof became one of the standard textbook examples of the power 
of the probabilistic method. (Another example perhaps being the strikingly 
simple proof of Shannon of the existence of exponentially complex Boolean 
functions.) 

The paper [25] proceeds by stating (1) in an inverse form: Define A(n) as 
the greatest integer such that given any graph G of n vertices, either it or its 
complementary graph contains a complete subgraph of order A(n). Then for 
A(n) > 3, 


logn 2logn 


A 
2 log 2 (n) < 


Despite considerable efforts over many years, these bounds have been 
improved only slightly (see [121, 117]). We commented on the upper bound 
improvements above. The best current lower bound is 


log2 © 


r(n) > (1+ o(1y) ean? 
which is twice the Erdés bound (when computed from his proof). 

The paper [25] was one of 23 papers which Erdés published within 3 years 
in the Bull. Amer. Math. Soc. and already here it is mentioned that although 
the upper bound for r(3,n) is quadratic, the present proof does not yield a 
nonlinear lower bound. That had to wait for another 10 years. 


II. The 1958 paper [27]|—Graph theory and probability. The main 
result of this paper deals with graphs, circuits, and chromatic number 
and as such does not seem to have much to do with Ramsey theory. Yet 
the paper starts with the review of bounds for r(k,k) and r(3,k) (all 
due to Erdés and Szekeres). Ramsey numbers are denoted as in most 
older Erdés papers by symbols of f(k), f(3,k), g(k). He then defines 
analogously the function h(k, @) as “the least integer so that every graph 
of h(k, 2) vertices contains either a closed circuit of & or fewer lines or the 
graph contains a set of independent points. Clearly h(3, 2) = f(3, 2)”. 


The main result of [27] is that h(k,@) > €'+1/?* for any fixed k > 3 and 
£ sufficiently large. The proof is one of the most striking early uses of the 
probabilistic method. Erdés was probably aware of it and this may explain 
(and justify) the title of the paper. It is also proved that h(2k+1,£) < cé!+1/* 
and this is proved by a variant of the greedy algorithm by induction on £. 
Now after this is claimed, it is remarked that the above estimation (1) leads 
to the fact that there exists a graph G with n vertices which contain no closed 
circuit of fewer than k edges and such that its chromatic number is > n°. 
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This side remark is in fact perhaps the most well known formulation of 
the main result of [27]: 


Theorem 4. For every choice of positive integers k,t and € there exists a 
k-graph G with the following properties: 


(1) The chromatic number of G > t. 
(2) The girth of G > &. 


This is one of the few true combinatorial classics. It started in the 40s 
with Tutte [20] and Zykov [126] for the case k = 2 and ¢ = 2 (i.e., for triangle- 
free graphs). Later, this particular case was rediscovered and also conjectured 
several times [22, 70]. Kelly and Kelly [70] proved the case k = 2, ¢ <5, and 
conjectured the general statement for graphs. This was settled by Erdos in 
[27] and the same probabilistic method has been applied by Erdés and Hajnal 
[35] to yield the general result for hypergraphs. 

Erdés and Rado [41] proved the extension of k = 2, ¢ = 2 to transfinite 
chromatic numbers while Erdés and Hajnal [36] gave a particularly simple 
construction of triangle-free graphs, so called shift graphs G = (V,E):V = 
{(i,g;1<i<g <n} and E = {(i,7),(@,97)31 <7 =i < jg}. Gn is triangle- 
free and x(Gp,) = [log n]. 

For many reasons it is desirable to have a constructive proof of Theorem 4. 
This has been stressed by Erdés on many occasions (and already in [27]). 
This appeared to be a difficult problem and a construction in full generality 
was finally given by Lovasz [81]. A simplified construction has been found 
in the context of Ramsey theory by NeSetiil and Rédl [91]. The graphs and 
hypergraphs with the above properties (i), (ii) are called highly chromatic 
(locally) sparse graphs, for short. Their existence could be regarded as one of 
the true paradoxes of finite set theory (see [35]) and it has always been felt 
that this result is one of the central results in combinatorics. 

Recently it has been realized that sparse and complex graphs may be used 
in theoretical computer science for the design of fast algorithms. However, 
what is needed there is not only a construction of these “paradoxical” 
structures but also their reasonable size. In one of the most striking recent 
developments, a program for constructing complex sparse graphs has been 
successfully carried out. Using several highly ingenious constructions which 
combine algebraic and topological methods it has been shown that there are 
complex sparse graphs, the size of which in several instances improves the 
size of random objects. See Margulis [84], Alon [2] and Lubotzky et al. [83]. 

Particularly, it follows from Lubotzky et al. [83] that there are examples 
of graphs with girth 2, chromatic number ¢ and the size at most t®’. A bit 
surprisingly, the following is still open: 

Find a primitive recursive construction of highly chromatic locally sparse 
k-uniform hypergraphs. Indeed, even triple systems (i.e., k = 3) present a 
problem. The best construction seems to be given in [75]. 
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III. r(3, 7) [28]. The paper [28] provides the lower bound estimate on the 
Ramsey number r(3, 7). Using probabilistic methods Erdés proved 
2 
r(3,n) > a (2) 
log* n 
(while the upper bound r(3,n) < ("3") follows from [46]). The estima- 
tion of Ramsey numbers r(3,n) was Erddés’ favorite problem for many 
years. We find it already in his 1947 paper [25] where he mentioned 
that he cannot prove the nonlinearity of r(3,n). Later he stressed this 
problem (of estimating r(3,n)) on many occasions and conjectured 
various forms of it. He certainly felt the importance of this special case. 
How right he was is clear from the later developments, which read as a 
saga of modern combinatorics. And as isolated as this may seems, the 
problem of estimating 7(3,n) became a cradle for many methods and 
results, far exceeding the original motivation. 


In 1981 Ajtai, Komlds and Szemerédi in their important paper [1] proved 
by a novel method 
2 


r(3,n) < (3) 
This bound and their method of proof has found many applications. 
The Ajtai, Komlés and Szemerédi proof was motivated by yet another Erdés 
problem from combinatorial number theory. In 1941 Erdés and Turan [48] 
considered problem of dense Sidon sequences (or Bz-sequences). An infinite 
sequence S = {a, < a2 <---} of natural numbers is called Sidon sequence if 
all pairwise sums a; + a; are distinct. Define 


“Tog n’ 


fs(n) = max{x : az <n} 


and for a given n, let f(n) denote the maximal possible value of f,(n). In [48], 
Erdés and Turan prove that for finite Sidon sequences f(n) ~ n!/? (improving 
Sidon’s bound of n!/4; Sidon’s motivation came from Fourier analysis [116]). 
However for every infinite Sidon sequence S' growth of f,(n) is a more difficult 


problem and as noted by Erdos and Turan, 
lim inf f,(n)/n1/? = 0. 


By using a greedy argument it was shown by Erdés [26] that f,(n) > n!/°. 
(Indeed, given k numbers 2, <... < x, up to n, each triple x; < x; < rp 
kills at most 3 other numbers x, 7 +2; = p+ 2, % +2, = 2; +2 and 
Lj; +a, = 2+ 2 and thus if k + 3(5) < ck? <n we can always find a 
number « < n which can be added to S.) Ajtai, Komlés and Szemerédi 
proved [1] using a novel “random construction” the existence of an infinite 
Sidon sequence S' such that 


fs(n) > c- (nlogn)¥/3, 
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An analysis of independent sets in triangle-free graphs is the basis of their 
approach and this yields as a corollary the above mentioned upper bound on 
r(3,n). (The best upper bound for f,(n) is of order c- (nlogn)!/?.) 

It should be noted that the above Erdés-Turdn paper [48] contains the 
following still unsolved problem: Let a, < ag <--: be an arbitrary sequence. 
Denote by f(n) the number of representations of n as a; + a;. Erdés and 
Turan prove that f(n) cannot be a constant for all sufficiently large n and 
conjectured that if f(m) > 0 for all sufficiently large n then lim sup f(n) = oo. 
This is still open. Erdés provided a multiplicative analogue of this conjecture 
(i.e., for the function g(n), the number of representation of n as a;a;); this is 
noted already in [48]. One can ask what this has to do with Ramsey theory. 
Well, not only was this the motivation for [1] but a simple proof of the fact 
that limsup g(n) = oo was given by NeSetril and Rodl in [93] just using 
Ramsey’s theorem. 

We started this paper by listing the predominance of Erdoés’s first works 
in number theory. But in a way this is misleading since the early papers of 
Erdés stressed elementary methods and often used combinatorial or graph- 
theoretical methods. The Erdés-Turan paper [48] is such an example and the 
paper [24] even more so. 

The innovative Ajtai-Komlés-Szemerédi paper [1] was the basis for a 
further development (see, e.g., [6]) and this in turn led somewhat surprisingly 
to the remarkable solution of Kim [72], who proved that the Ajtai-Komldés- 
Szemerédi bound is up to a constant factor, the best possible, i.e., 

2 
r(n,3) >c¢ " 


logn’ 


Thus r(n, 3) is a nontrivial infinite family of (classical) Ramsey numbers with 
known asymptotics. Recently, there are more such examples, see [3, 4, 5]. 


IV. Constructions. It was realized early by Erdés the importance of finding 
explicit constructions of various combinatorial objects whose existence 
he justified by probabilistic methods (e.g., by counting). In most cases 
such constructions have not yet been found but even constructions 
producing weaker results (or bounds) formed an important line of 
research. For example, the search for an explicit graph of size (say) 
2"/2 which would demonstrate this Ramsey lower bound has been so 
far unsuccessful. This is not an entirely satisfactory situation since it is 
believed that such graphs share many properties with random graphs 
and thus they could be good candidates for various lower bounds, for 
example, in theoretical computer science for lower bounds for various 
measures of complexity. (See the papers [13] and [122] which discuss 
properties of pseudo- and quasirandom graphs.) 


The best constructive lower bound for Ramsey numbers r(n) is due to 
Frankl and Wilson. This improves on an earlier construction of Frankl [51] 
who found the first constructive superpolynomial lower bound. 
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The construction of Frankl-Wilson graphs is simple: 
Let p be a prime number, and put g = p®. Define the graph G, = (V, E) 
as follows: 


v= es) =41FC (euge pa Fle — Oh, 
{F,F}e€E iff |FNF|=-1 (mod q). 


The graph G, has (ea vertices. However, the Ramsey properties of the 
graph G, are not trivial to prove: It follows only from deep extremal 
set theory results due to Frankl and Wilson [53] that neither G, nor its 
complement contain K,, for n > (ee ve This construction itself was motivated 
by several extremal problems of Erdés and in a way (again!) the Frankl- 
Wilson construction was a byproduct of these efforts. 

We already mentioned earlier the developments related to Erdés paper 
[27]. The constructive version of bounds for r(3,) led Erdés to geometrically 
defined graphs. An early example is Erdés-Rogers paper [45] where they prove 
that there exists a graph G with ¢!+° vertices, which contains no complete 
k-gon, but such that each subgraph with ¢ vertices contains a complete 
(k — 1)-gon. 

If we denote by h(k,@) the minimum integer such that every graph of 
h(k, 2) vertices contains either a complete graph of k vertices or a set of £ 
points not containing a complete graph with k — 1 vertices, then 


h(k, 0) < r(k, 2). 


However, for every k > 3 we still have h(k,@) > ¢'*°. 

This variant of the Ramsey problem is due to A. Hajnal. The construction 
of the graph G is geometrical: the vertices of G are points on an n-dimensional 
sphere with unit radius, and two points are joined if their Euclidean distance 
exceeds \/2k/(k — 1). 

Graphs defined by distances have been studied by many people (e.g., see 
[101]). The best constructive lower bound on r(3,n) is due to Alon [3] and 
gives r(3,n) > cn3/?. See also a remarkable elementary construction [12] 
(and also [16] which gives a weaker result). 


4. Ramsey Theory 


It seems that the building of a theory per se was never Erdés’s preference. 
He was a life-long problem solver, problem poser, admirer of mathematical 
miniatures and beauties. THE BOOK is an ideal. Instead of developing the 
whole field he seemed always to prefer consideration of particular cases. 
However, many of these cases turned out to be key cases and somehow 
theories emerged. 
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Nevertheless, one can say that Erdés and Rado systematically investi- 
gated problems related to Ramsey’s theorem with a clear vision that here 
was a new basis for a theory. In their early papers [42, 43] they investigated 
possibilities of various extensions of Ramsey’s theorem. It is clear that these 
papers are a result of a longer research and understanding of Ramsey’s 
theorem. As if these two papers summarized what was known, before Erdés 
and Rado went on with their partition calculus projects reflected by the 
grand papers [44] and [37]. But this is beyond the scope of this paper. Erdés 
and Rado [42] contains an extension of Ramsey’s theorem for colorings by 
an infinite number of colors. This is the celebrated Erdés-Rado canonization 
lemma: 


Theorem 5 ([42]). For every choice of positive integers p and n there exists 
N = N(p,n) such that for every set X, |X| > N, and for every coloring 


c: ) > N (i.e, a coloring by arbitrarily many colors) there exists an n- 


element subset Y of X such that the coloring c restricted to the set () is 
“canonical”. 


Here a coloring of () is said to be canonical if there exists an ordering 
Y =y1 <... < Yn and a subset w C {1,...,p} such that two n-sets {z < 


. < Zp} and {z, < --- < 2} get the same color if and only if 2, = z; 
for exactly 7 € w. Thus there are exactly 2? canonical colorings of p-tuples. 
The case w = ¢ corresponds to a monochromatic set while w = {1,...,p} 


to a coloring where each p-tuple gets a different color (such a coloring is 
sometimes called a “rainbow” or totally multicoloring). 

Erd6és and Rado deduced Theorem 5 from Ramsey’s theorem. For exam- 
ple, the bound N(p,n) < r(2p,2??,n) gives a hint as to how to prove it. 
One of the most elegant forms of this argument was published by Rado [103] 
in one of his last papers. 

The problem of estimating N(p,n) was recently attacked by Lefman 
and Rédl [80] and Shelah [113]. One can see easily that Theorem 5 implies 
Ramsey’s theorem (e.g., N(p,n) > r(p,n — 2,n)) and the natural question 
arises as to how many exponentiations one needs. In [80] this was solved 
for graphs (p = 2) and Shelah [113] solved recently this problem in full 
generality: N(p,n) is the lower function of the same height r(p,4,n) i.e., 
(p — 1) exponentiations. 

The Canonization Lemma found many interesting applications (see, e.g., 
[98]) and it was extended to other structures. For example, the canonical van 
der Waerden theorem was proved by Erdés and Graham [31]. 


Theorem 6 ([31]). For every coloring of positive integers one can find 
either a monochromatic or a rainbow arithmetic progression of every length. 
(Recall: a rainbow set is a set with all its elements colored differently.) 


This result was extended by Lefman [79] to all regular systems of linear 
equations (see also [21]) and in an extremal setting by Erdds et al. [38]. 
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One of the essential parts of the development of the “new Ramsey theory” 
age was the stress on various structural extensions and structure analogies 
of the original results. A key role was played by the Hales-Jewett theorem 
(viewed as a combinatorial axiomatization of van der Waerden’s theorem), 
Rota’s conjecture (the vector-space analogue of Ramsey’s theorem), Graham- 
Rothschild parameter sets, all dealing with new structures. These questions 
and results displayed the richness of the field and attracted so much attention 
to it. 

It seems that one of the significant turns appeared in the late 60s when 
Erdés, Hajnal and Galvin started to ask questions such as “which graphs 
contain a monochromatic triangle in any 2-coloring of its edges”. Perhaps 
the essential parts of this development can be illustrated with this particular 
example. 

We say that a graph G = (V,F) is t-Ramsey for the triangle (ie., 
K3) if for every coloring of E by t-colors, one of the colors contains a 
triangle. Symbolically we denote this by G + (K3)?. This is a variant of 
the Erdés-Rado partition arrow. Ramsey’s theorem gives us Kg > (K3)2 
(and K,.2,t,3) + (K3)?). But there are other essentially different examples. 
For example, a 2-Ramsey graph for K3 need not contain Kg. Graham 
[60] constructed the unique minimal graph with this property: The graph 
Kk3+ Cs (triangle and pentagon completely joined) is the smallest graph 
G with G > (K3)3 which does not contain a Kg. Yet K3 + Cs contains 
Ks and subsequently van Lint, Graham and Spencer constructed a graph 
G not containing even a K5, with G — (K3)3. Until recently, the smallest 
example was due to Irving [68] and had 18 vertices. Very recently, two more 
constructions appeared by Erickson [49] and Bukor [11] who found examples 
with 17 and 16 vertices (both of them use properties of Graham’s graph). 

Of course, the next question which was asked is whether there exists a 
K,4-free graph G with G — (K3)3. This question proved to be considerably 
harder and it is possible to say that it has not yet been solved completely 
satisfactorily. 

The existence of a Ky,-free graph G which is t-Ramsey for K3 was 
settled by Folkman [50] ( = 2) and NeSetzil and Rédl [94]. The proofs 
are complicated and the graphs constructed are very large. Perhaps just to 
be explicit Erdés [29] asked whether there exists a K4-free graph G which 
arrows triangle with fewer than 10!° vertices. This question proved to be 
very motivating and it was later shown by Spencer [118] that there exists 
such a graph with 3 x 10° vertices. More recently, it was shown by Lu [82] 
with the help of computers that such a graph exists with 9,697 vertices, and 
subsequently Dudek and Rédl reduced this number to 941. The record is 
currently held by Lange, Radziszowski and Xu [76] who found such a graph 
on just 786 vertices. Of course, it is possible that such a graph exists with 
fewer than 100 vertices! (In fact, one of the authors offers US$100 for the 
first person to find such a graph). However, for more than 2 colors the known 
K,-free Ramsey graphs are still astronomical. 
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Probabilistic methods were not only applied to get various bounds for 
Ramsey numbers. Recently, the Ramsey properties of the Random Graph 
K(n,p) were analyzed by Rédl and Rucirski and the threshold probability 
for p needed to guarantee K(n,p) + (K3)? with probability tending to 1 as 
n — oo, was determined (see [107]). 

Structural properties of Ramsey’s theorem have also been investigated. 
For example, the Erdés problems involving > aa where the sum is over 
homogeneous subsets of {1,2,...,n} and problems concerning the relative 
order of gaps of homogeneous sets were treated for graphs in [106, 18] (see also 
similar problems for ordering pigeonhole [10]). 

Many of these questions were answered in a much greater generality and 
this seems to be a typical feature for the whole area. On the other side 
these more general statements explain the unique role of the original Erdés 
problem. Let us be more specific. We need a few definitions: An ordered 
graph is a graph with a linearly ordered set of vertices (we speak about 
“admissible” orderings). Isomorphism of ordered graphs means isomorphism 
preserving admissible orderings. If A, B are ordered graphs (for now we will 
find it convenient to denote graphs by A, B,C,...) then (4) will denote the 
set of all induced subgraphs of B which are isomorphic to A. We say that a 
class K of graphs is Ramsey if for every choice of ordered graphs A, B from 
K there exists C € K such that C > (B)#. Here, the notation C — (B)4 
means: for every coloring c : (G) + {1,2} there exists B € ({) such that the 


set ) is monochromatic (see, e.g., [88].) Similarly we say that a class K of 
graphs is canonical if for every choice of ordered graphs A, B from K there 
exists C’ € K with the following property: For every coloring c : (5) 7+ N 
there exists B € (G) such that the set (%) has a canonical coloring. 

Denote by Forb(K;,) the class of all K,-free graphs. Now we have the 
following 


Theorem 7. For a hereditary class K of graphs the following statements are 
equivalent: 


1. K (with some admissible orderings) is Ramsey; 

2. K (with some admissible orderings) is canonical; 

3. K is a union of the following 4 types of classes: the class Forb( Ky), 
the class of complements of graphs from Forb(K,), the class of Turdén 
graphs (i.e., complete multipartite graphs) and the class of equivalences 
(i.e., complements of Turan graphs). 


(1. = 3. is proved in [87] establishing important connection of Ramsey 
classes and ultra homogeneous structures. 2. => 1. is easy, and one can prove 
1. = 2. directly along the lines of Erdds-Rado proof of the canonization 
lemma.) Thus, as often in Erdés’ case, the triangle-free graphs was not just 
any case but rather the typical case. 

From today’s perspective it seems to be just a natural step to consider. 
Ramsey properties of geometrical graphs. This was initiated in a series of 
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papers by Erdds, Graham, Montgomery, Rothschild, Spencer and Straus, 
32, 33, 34]. Let us call a finite configuration C of points in E” Ramsey if 
for every r there is an N = N(r) so that in every r-coloring of the points of 
7” a monochromatic congruent copy of C' is always formed. For example, the 
vertices of a unit simplex in E” is Ramsey (with N(r) = n(r—1)+n7), and it is 
not hard to show that the Cartesian product of two Ramsey configurations is 
also Ramsey. More recently, Frankl and Rédl [52] showed that any simplex in 
K” is Ramsey (a simplex is a set of 2 +1 points having a positive n-volume). 

In the other direction, it is known [32] that any Ramsey configuration 
must lie on the surface of a sphere (i.e., be “spherical” ). Hence, 3-collinear 
points do not form a Ramsey configuration, and in fact, for any such set Cs, 
‘N can always be 16-colored so as to avoid a monochromatic congruent copy 
of C3. It is not known if the value 16 can be reduced (almost certainly it 
can). The major open question is to characterize the Ramsey configurations. 
It is natural to conjecture that they are exactly the class of spherical sets. 
Additional evidence of this was found by Kriz [74] who showed for example, 
that the set of vertices of any regular polygon is Ramsey (see [85] for a positive 
answer to a weaker version). However, Leader, Russell and Walters [77] have 
a different conjecture as to which sets are Euclidean Ramsey sets. Let us call 
a finite set in Euclidean space subtransitive if it is a subset of a set which 
has a transitive automorphism group. They conjecture that the Euclidean 
Ramsey sets are exactly the subtransitive sets. These two conjectures are not 
compatible since they also show [78] that almost all 4-points subsets of a 
(unit) circle are not subtransitive. A fuller discussion of this interesting topic 
can be found in [61] and [62]. 


5. Adventures in Arithmetic Progressions 


Besides Ramsey’s theorem itself the following result provided constant 
motivation for Ramsey Theory: 


Theorem 8 (van der Waerden [124]). For every choice of positive 
integers k and n, there exists a least N(k,n) = N such that for every partition 
of the set {1,2,...,N} into k classes, one of the classes always contains an 
arithmetic progression with n terms. 


The original proof of van der Waerden (which developed through 
discussions with Artin and Schreier—see [125] for an account of the discovery) 
and which is included in an enchanting and moving book of Khinchine [71] 
was until recently essentially the only known proof. However, interesting 
modifications of the proof were also found, the most important of which is 
perhaps the combinatorial formulation of van der Waerden’s result by Hales 
and Jewett [65]. 

The distinctive feature of van der Waerden’s proof (and also of Hales- 
Jewett’s proof) is that one proves a more general statement and then uses 
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double induction. Consequently, this procedure does not provide a primitive 
recursive upper bound for the size of N (in van der Waerden’s theorem). 
On the other hand, the best bound (for n prime) is (only!) W(n +1) > n2”, 
n prime (due to Berlekamp [9]). Thus, the question of whether such a huge 
upper bound was also necessary, was and remains to be one of the main 
research problems in the area. In 1988, Shelah [112] gave a new proof of both 
van der Waerden’s and the Hales-Jewett’s theorem which provided a primitive 
recursive upper bound for N(k,n). However the bound was still very large, 
being of the order of fifth function in the Ackermann hierarchy—“tower of 
tower functions”. a 

Even for a proof of the modest looking conjecture N(2,n) < 22° where 
the tower of 2’s has height n, the first author of this paper offered $1,000. 
(He subsequently happily paid this reward to Tim Gowers for his striking 
improvement for upper bounds on the related function r,(n) which we define 
in the next section). The first author currently (foolishly?) offers $1,000 for 
a proof (or disproof) that N(2,n) < 2” for every n. 

The discrepancy between the general upper bound for van der Waerden 
numbers and the known values is the best illustrated for the first nontrivial 
value: while N(2,3) = 9, Gowers’ proof gives the bound 


24,096 


N(2,3)<2? ! 


These observations are not new and were considered already in the Erd6és and 
Turan 1936 paper [47]. For the purpose of improving the estimates for the van 
der Waerden numbers, they had the idea of proving a stronger—now called a 
density—statement. They considered (how typical!) the particular case of 3- 
term arithmetic progressions and for a given positive integer N, defined r(N) 
(their notation) to denote the maximum number elements of a sequence of 
numbers < N which does not contain a 3-term arithmetic progression. They 
observed the subadditivity of function r(V) (which implies the existence of 
a limiting value of r(N)/N) and proved r(N) < (2+€)N for all N > N(E). 

After that they remarked that probably r(N) = o(N). And in the last 
few lines of their paper, they define numbers rz,(N) to denote the maximum 
number of integers less than or equal to N such that no & of them form an 
arithmetic progression. Although they do not ask explicitly whether r,(N) = 
o(N) (as Erdés did many times since), this is clearly in their mind as they list 
consequences of a good upper bound for r;,(V): long arithmetic progressions 
formed by primes (yes, already there!) and a better bound for the van der 
Waerden numbers. 

As with the Erdés-Szekeres paper [46], the impact of the modest Erdés- 
Turan note [47] is hard to overestimate. Thanks to its originality, both in 
combinatorial and number theoretic contexts, and to Paul Erdés’ persistence, 
this led eventually to beautiful and difficult research, and probably beyond 
Erdés’ expectations, to a rich general theory. We wish to briefly mention 
some key points of this development where the progress has been remarkably 
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rapid, so that van der Waerden’s theorem with it many variations and related 
problems has become one of the fastest growing (and successful) areas in 
mathematics. It cannot be the purpose of this article (which concentrates 
narrowly on the work of Erdés) to survey this body of work (for a good 
start, see {119]). In particular, this development has lead to 2 Fields Medals 
(Gowers 2002, Tao 2006) and more recently, to an Abel Prize (Szemerédi 
2012). In particular, Gowers [58] gave a new bound for r,z(n) which as a 
consequence gave the strongest current upper bound for the van der Waerden 
function W(2,n) of the form 
gnt9 


a 
W(2,n) < 2?” 


o] 


thereby earning the above-mentioned $1,000 prize. (Strictly speaking, Gowers’ 
bound for W(2,n) is larger then required conjectured bound given by the 
tower of n 2’s for the values of n = 7 and 8 but it was judged to be close 
enough to deserve the full prize!) In addition, Green and Tao [64] proved 
the existence of arbitrarily long arithmetic progression of primes in any set 
of integers of positive upper density (thus solving a problem attributed to 
Legendre). Most of these advances were motivated by and more or less 
directly related to the Erdés-Turan function rz.(n). Soon after [47] good lower 
estimates for r(N) were obtained by Salem and Spencer [110] and Behrend [8] 
which still gives the best bounds. These bounds recently found a surprising 
application in a least expected area, namely in the fast multiplication of 
matrices (Coppersmith and Winograd [19]). 

The upper bounds and rz(N) = o(N) appeared to be much harder. 
In 1953 K. Roth [109] proved r3(.V) = o(V) and after several years of partial 
results, E. Szemerédi in 1975 [92] proved the general case 


rp(N) = o(N) for every k. 


This is generally recognized as the single most important solution of an 
Erdés problem, the problem for which he has paid the largest reward. By now 
there are more expensive problems (see Erdés’ article in these volumes) but 
they have not yet been solved. And taking inflation into account, possibly 
none of them will ever have as an expensive solution. Szemerédi’s proof 
changed Ramsey theory in at least two aspects. First, several of its pieces, 
most notably the so-called Regularity Lemma, proved to be very useful 
in many other combinatorial situations (see e.g., [15, 92, 107]). Secondly, 
perhaps due to the complexity of Szemerédi’s combinatorial argument, and 
the beauty of the result itself, an alternative approach was called for. Such 
an approach was found by Hillel Furstenberg [54, 55] and developed further 
in many aspects in his joint work with B. Weiss, Y. Katznelson and others. 
Let us just mention two results which in our opinion best characterize the 
power of this approach: In [56] Furstenberg and Katznelson proved the density 
version of Hales-Jewett theorem, and Bergelson and Leibman [7] proved the 
following striking result (conjectured by Furstenberg): 
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Theorem 9 ([7]). Let pi,...,pr be polynomials with rational coefficients 
taking integer values on integers and satisfying p;(0) = 0 fori = 1,...,k. 
Then every set X of integers of positive density contains for every choice of 
numbers v1,...,Uk, a subset 


wt pi(d)vr, w+ po(d)v2,.-.,# + pe(d)ve 
for some pw andd> 0. 


Choosing p;(z) = x and v; = 7 we get the van der Waerden theorem. 
Already, the case p;(z) = x? and v; = i was open for several years [111] 
(this gives long arithmetic progressions in sets of positive density with their 
differences being a square). 

Originally, none of these results was proved by combinatorial methods. 
Instead, they were all proved by a blend of topological dynamics and ergodic 
theory methods, proving countable extensions of these results. For this part of 
Ramsey theory this setting seems to be most appropriate. In some sense, this 
is a long way from the original Erdés-Turan paper. However, this emphasis 
as been changing recently with combinatorial proofs of many of the results 
in the area, most notably of the density version of the Hales-Jewett theorem 
(see [100]). 

And even more recently, the situation reversed as Rédl’s project of a 
combinatorial approach to Szemerédi’s theorem [105] using a hypergraph 
generalization of the regularity lemma was successful, see e.g., [59, 108]. This 
generalization in turn was related to model theory, probability and analysis, 
see e.g., recent papers [119, 120]. This development probably far exceeded 
even Erdés’ expectations. 

Let us close this section with a very concrete and still unsolved example. 
In 1983, G. Pisier [99] formulated (in a harmonic analysis context) the 
following problem: A set of integers 71 < xg < ... is said to be independent if 
all finite subsums of distinct elements are distinct. Now let X be an infinite 
set and suppose for some € > 0 that every finite subset Y C X contains a 
subsubset Z of size > €|Z| which is independent. Is it then true that X is a 
finite union of independent sets? 

Despite much effort and partial solutions, the problem is still open. It was 
again Paul Erdés who quickly realized the importance of the Pisier problem 
and as a result, Erdés, NeSetiil and Rodl [39, 40] studied “Pisier type 
problems”. For various notions of an independence relation, the following 
question was considered: Assume that an infinite set X satisfies for some 
€ > 0, some hereditary density condition (i.e., we assume that every finite set 
Y contains an independent subsubset of size > €/Y|). Is it then true that X 
can be partitioned into finitely many independent sets? 

Positive instances (such as collinearity, and linear independence) as well 
as negative instances (such as Sidon sets) were given in [39, 40]. Also various 
“finitization versions” and analogues of the Pisier problem were answered in 
the negative. But at present the original Pisier problem is still open. In a 
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way one can consider Pisier type problems as dual to the density results in 
Ramsey theory: One attempts to prove a positive Ramsey type statement 
under a strong (hereditary) density condition. This is exemplified in [40] by 
the following problem which is perhaps a fitting conclusion to this paper 
surveying 60 years of Paul Erdés’ service to Ramsey theory. 


The Anti-Szemerédi Problem [40] 


Does there exist a set X of positive integers such that for some € > 0 the 
following two conditions hold simultaneously: 


(1) For every finite Y C X there exists a subset Z C X, |Z| > €|Y|, which 
does not contain a 3-term arithmetic progression; 

(2) Every finite partition of X contains a 3-term arithmetic progression in one 
of its classes. 
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Shadows of Memories (Ramsey Theory, 1984) 
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From left to right: B. L. Rothschild, W. Deuber, P. Erdés, 
B. Voigt, H.-J. Promel, R. L. Graham, J. NeSetiil, V. Rédl. 


Ramsey Theory in the Work of Paul Erdés 189 


References 


1. 
2: 
3. 
4. 
5. 


6. 
. V. Bergelson and A. Leibman, Polynomial extensions of van der Waerden’s 


23. 


24. 


M. Ajtai, J. Komléds and E. Szemerédi, A dense infinite Sidon sequence, 
European J. Comb. 2 (1981), 1-11. 

N. Alon, Eigenvalues, geometric expanders, sorting in rounds and Ramsey 
theory, Combinatorica 3 (1986), 207-219. 

N. Alon, Explicit Ramsey graphs and orthonormal labelings, Electron. J. Com- 
bin. 1, R12 (1994), (8pp). 

N. Alon, Discrete Mathematics: Methods and Challenges. In: Proc. of the ICM 
2002, Bejing, China Higher Education Press 2003, 119-141. 

N. Alon, V. Rédl, Sharp bounds for some multicolour Ramsey numbers, 
Combinatorica 25 (2005), 125-141. 

N. Alon and J. Spencer, The Probabilistic Method, Wiley, New York, 1992. 


and Szemerédi’s theorems, J. Amer. Math. Soc. 9 (1996), 725-753. 


. F. A. Behrend, On sets of integers which contain no three in arithmetic 


progression, Proc. Nat. Acad. Sci. 23 (1946), 331-332. 


. E. R. Berlekamp, A construction for partitions which avoid long arithmetic 


progressions, Canad. Math. Bull 11 (1968),409-414. 


. B. Bukh, and J. MatouSek, Erdés-Szekeres-type statements: Ramsey functions 


and decidability in dimension 1, arXiv: 1207.0705 v1 (3 July 2012). 


. J. Bukor, A note on the Folkman number F(3,3,5), Math. Slovaka 44 (4), 


(1994), 479-480. 


. F. R. K. Chung, R. Cleve, and P. Dagum, A note on constructive lower bound 


for the Ramsey numbers R(3,t), J. Comb. Theory 57 (1993), 150-155. 


.F. R. K. Chung, R. L. Graham and R. M. Wilson, Quasirandom graphs, 


Combinatorica 9 (1989), 345-362. 


. V. Chvatal and J. Komlos, Some combinatorial theorems on monotonicity, 


Canad. Math. Bull. 14, 2 (1971). 


. V. Chvatal, V. Rédl, E. Szemerédi, and W. Trotter, The Ramsey number of 


graph with bounded maximum degree, J. Comb. Th. B, 34 (1983), 239-243. 


. B. Codenotti, P. Pudlak and J. Resta, Some structural properties of low rand 


matrices related to computational complexity, Theoret. Comp. Sci. 235 (2000), 
89-107. 


. D. Conlon, A new upper bound for diagonal Ramsey numbers, Ann. Math. 


170 (2009), 941-960. 


. D. Conlon, J. Fox and B. Sudakov, Two extensions of Ramsey’s theorem 


(to appear in Duke Math. Jour). 


. D. Coppersmith and S. Winograd, Matrix multiplication via arithmetic 


progressions, J. Symb. Comput. 9 (1987), 251-280. 


. Blanche Descartes, A three colour problem, Eureka 9 (1947), 21, Eureka 10 


(1948), 24. (See also the solution to Advanced problem 1526, Amer. Math. 
Monthly 61 (1954), 352.) 


. W. Deuber, R. L. Graham, H. J. Prémel and B. Voigt, A canonical partition 


theorem for equivalence relations on Z*, J. Comb. Th. (A) 34 (1983), 331-339. 


. G. A. Dirac, The structure of k-chromatic graphs, Fund. Math. 40 (1953), 


42-55. 

A. Dudek and V. Rédl, On the Folkman number f (2,3, 4), Experimental Math. 
17 (2008), 63-67. 

P. Erdés, On sequences of integers no one of which divides the product of two 
others and on some related problems, Izv. Nanc. Ise. Inset. Mat. Mech. Tomsrk 
2 (1938), 74-82. 


190 


25 


26. 
27. 


28. 
29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


Ron L. Graham and Jaroslav NeSetril 


. P. Erdés, Some remarks on the theory of graphs, Bull. Amer. Math. Soc. 53 
(1947), 292-294. 

P. Erdés, Problems and results in additive number theory, Colloque sur la 
Theorie des Numbres, Bruxelles (1955), 127-137. 

P. Erdés, Graph theory and probability, Canad. J. Math. 11 (1959), 34-38. 
P. Erdés, Graph theory and probability II, Canad. J. Math. 13 (1961), 346-352. 
P. Erdés, Problems and result on finite and infinite graphs, In: Recent 
Advances in Graph Theory (ed. M. Fiedler), Academia, Prague (1975), 183- 
192. 

P. Erdés, Art of Counting, MIT Press, 1973. 

P. Erdés and R. L. Graham, Old and New Problems and Results in 
Combinatorial Number Theory, L’ Enseignement Math. 28 (1980), 128 pp. 

P. Erdoés, R. L. Graham, P. Montgomery, B. L. Rothschild, J. H. Spencer, 
and E. G. Straus, Euclidean Ramsey Theorem, J. Combin. Th. (A) 14 (1973), 
341-363. 

P. Erdoés, R. L. Graham, P. Montgomery, B. L. Rothschild, J. H. Spencer, 
and E. G. Straus, Euclidean Ramsey Theorems II, In A. Hajnal, R. Rado and 
V. Sé6s, eds., Infinite and Finite Sets I, North Holland, Amsterdam, 1975, pp. 
529-557. 

P. Erdoés, R. L. Graham, P. Montgomery, B. L. Rothschild, J. H. Spencer, 
and E. G. Straus, Euclidean Ramsey Theorems III, In A. Hajnal, R. Rado 
and V. Sés, eds., Infinite and Finite Sets II, North Holland, Amsterdam, 1975, 
pp. 559-583. 

P. Erdés and A. Hajnal, On chromatic number of set systems, Acta Math. 
Acad. Sci. Hungar. 17 (1966), 61-99. 

P. Erdés and A. Hajnal, Some remarks on set theory IX, Mich. Math. J. 11 
(1964), 107-112. 

P. Erdés, A. Hajnal and R. Rado, Partition relations for cardinal numbers, 
Acta Math. Hungar. 16 (1965), 93-196. 

P. Erdés, J. NeSetyil and V. Rédl, Selectivity of hypergraphs, Colloq. Math. 
Soc. Janos Bolyai 37 (1984), 265-284. 

P. Erdés, J. NeSet7il, and V. Rédl, On Pisier Type Problems and Results 
(Combinatorial Applications to Number Theory). In: Mathematics of Ramsey 
Theory (ed. J. NeSetfil and V. Rédl), Springer Verlag (1990), 214-231. 

P. Erdés, J. NeSetril, and V. Rédl, On colorings and independent sets (Pisier 
Type Theorems) (preprint). 

P. Erdés and R. Rado, A construction of graphs without triangles having 
preassigned order and chromatic number, J. London Math. Soc. 35 (1960), 
445-448. 

P. Erdés and R. Rado, A combinatorial theorem, J. London Math. Soc. 25 
(1950), 249-255. 

P. Erdés and R. Rado, Combinatorial theorems on classifications of subsets of 
a given set, Proc. London Math. Soc. 3 (1951),417—439. 

P. Erdés and R. Rado, A partition calculus in set theory, Bull. Amer. Math. 
Soc. 62 (1956), 427-489. 

P. Erdés and C. A. Rogers, The construction of certain graphs, Canad. J. 
Math. (1962), 702-707. 

P. Erdés and G. Szekeres, A combinatorial problem in geometry, Composito 
Math. 2 (1935), 464-470. 

P. Erdés and P. Turan, On some sequences of integers, J. London Math. Soc. 
11 (1936), 261-264. 

P. Erdés and P. Turan, On a problem of Sidon in additive number theory and 
on some related problems, J. London Math. Soc. 16 (1941), 212-215. 


Ramsey Theory in the Work of Paul Erdéds 191 


49 


50. 


ol. 


52. 


53. 


54. 


55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


. M. Erickson, An upper bound for the Folkman number F'(3, 3,5), J. Graph 
Th. 17 (6), (1993), 679-68. 

J. Folkman, Graphs with monochromatic complete subgraphs in every edge 
coloring, SIAM J. Appl. Math. 18 (1970), 19-24. 

P. Frankl, A constructive lower bound for Ramsey numbers, Ars Combinatorica 
2 (1977), 297-302. 

P. Frankl and V. Rédl, A partition property of simplices in Euclidean space, 
J. Amer. Math. Soc. 3 (1990), 1-7. 

P. Frankl and R. M. Wilson, Intersection theorems with geometric conse- 
quences, Combinatorica 1 (1981), 357-368. 

H. Furstenberg, Ergodic behavior of diagonal measures and a theorem of 
Szemerédi on arithmetic progressions, J. Anal. Math. 31 (1977), 204-256. 

H. Furstenberg, Recurrence in Ergodic Theory and Combinatorial Number 
Theory, Princeton Univ. Press, Princeton, 1981. 

H. Furstenberg and Y. Katznelson, A density version of the Hales-Jewett 
theorem, J. Analyze Math. 57 (1991), 61-85. 

T. Gerken, On empty convex hexagons in planar point sets, In: J. Goodman, 
J. Pach and R. Pollack (eds.), Twentieth Anniversary Volume, Disc. Comput. 
Geom, Springer, 2008, 1-34. 

W. T. Gowers, A new proof of Szemerédi’s theorem, Geom. Funct. Anal 11 
(2001), 465-588. 

W. T. Gowers, Hypergraph regularity and the multidimensional Szemerédi 
theorem, Ann. Math. 166 (2007), 897-946. 

R. L. Graham, On edgewise 2-colored graphs with monochromatic triangles 
and containing no complete hexagon, J. Comb. Th. 4 (1968), 300. 

R. L. Graham, Euclidean Ramsey theory, in Handbook of Discrete and 
Computational Geometry, J. Goodman and J. O’Rourke, eds., CRC Press, 
1997, 153-166. 

R. L. Graham, Open problems in Euclidean Ramsey theory, Geombinatorics 
13 (2004), 165-177. 

R. L. Graham, B. L. Rothschild, and J. Spencer, Ramsey theory, Wiley, 1980, 
2nd edition, 1990. 

B. Green and T. Tao, The primes contain arbitrarily long arithmetic progres- 
sions, Ann. Math. 167 (2008), 481-547. 

A. W. Hales and R. I. Jewett, Regularity and positional games, Trans. Amer. 
Math. Soc. 106 (1963), 222-229. 

P. Hell and R. L. Graham, On the history of the minimum spanning tree 
problem, Annals of Hist. Comp. 7 (1985), 43-57. 

D. Hilbert, Uber die irreducibilitat ganzer rationaler funktionen mit ganzzahli- 
gen koefficienten, J. Reine und Angew. Math. 110 (1892), 104-129. 

R. Irving, On a bound of Graham and Spencer for a graph-coloring constant, 
J. Comb. Th. B, 15 (1973), 200-203. 

V. Jarnik and M. Kossler, Sur les graphes minima, contenant n points donnés, 
Cas. Pést. Mat. 63 (1934), 223-235. 

J. B. Kelly and L. M. Kelly, Paths and circuits in critical graphs, Amer. 
J. Math. 76 (1954), 786-792. 

A. J. Khinchine, Drei Perlen der Zahlen Theorie, Akademie Verlag, Berlin 1951 
(reprinted Verlag Harri Deutsch, Frankfurt 1984). 

J. H. Kim, The Ramsey number R(3,t) has order of magnitude ¢?/logt, 
Random Structures and Algorithms 7 (1995), 173-207. 

V. A. Koshelev, The Erdés-Szekeres problem, Dokl. Akad. Nauk.,415 (2007), 
734-736. 


192 


TA. 


75. 


76. 


77. 


78. 


79. 


80. 


81. 


82. 


83. 


84. 


85. 


86. 
87. 


88. 


89. 


90. 


91. 


92. 


93. 


94. 


95. 


96. 


97. 


98. 


99. 


Ron L. Graham and Jaroslav NeSetril 


I. Kz, Permutation groups in Euclidean Ramsey theory, Proc. Amer. Math. 

Soc. 112 (1991), 899-907. 

G. Kun, Constraints, MMSNP and expander relational structures, arXiv: 

0706.1701 (2007). 

A. Lange, S. Radziszowski, X. Xu, Use of MAX-CUT for Ramsey arrowing of 

triangles, arXiv:1207.3750 (Jul. 16, 2012). 

I. Leader, P. Russell and M. Walters, Transitive sets in Euclidean Ramsey 

theory, J. Comb. Th. (A), 119 (2012), 382-396. 

I. Leader, P. Russell and M. Walters, Transitive sets and cyclic quadrilaterals, 

J. Comb. 2 (2011), 457-462. 

H. Lefman, A canonical version for partition regular systems of linear 

equations, J. Comb. Th. (A) 41 (1986), 95-104. 

H. Lefman and V. Rodl, On Erdés-Rado numbers, Combinatorica 15 (1995), 

85-104. 

L. Lovasz, On the chromatic number of finite set-systems, Acta Math. Acad. 

Sci. Hungar. 19 (1968), 59-67. 

L. Lu, Explicit construction of small Folkman graphs, SIAM J. on Disc. Math. 
1 (2008), 1053-1060. 

A. Lubotzky, R. Phillips, and P. Sarnak, Ramanujan Graphs, Combinatorica 

8(3) (1988) ,261-277. 

G. A. Margulis, Explicit constructions of concentrators, Problemy Peredachi 

Informatsii 9, 4 (1975), 71-80. 

J. Matougek, and V. Rédl, On Ramsey sets in spheres, J. Comb. Th. (A) 70 

(1995), 30-44. 

J. Mycielski, Sur le coloriage des graphes, Colloq. Math. 3 (1955), 161-162. 

J. NeSetiil: For graphs there are only four types of hereditary Ramsey classes, 

J. Comb. Th. B 46 (1989), 127-132. 

J. NeSetyil, Ramsey Theory, In: Handbook of Combinatorics, North Holland 

(1995), 1331-1403. 


J. NeSetril, H. NeSetiilova, The origins of minimal spanning tree algorithms, 
Bortvka, Jarnfk, Dokumenta Math.(2012), 127-141. 

J. Neset#il and V. Rédl, A probabilistic graph theoretical method, Proc. Amer. 
Math. Soc. 72 (1978), 417-421. 

J. NeSetiil and V. Rédl, A short proof of the existence of highly chromatic 
graphs without short cycles, J. Combin. Th. B, 27 (1979), 225-227. 

J. NeSet7il and V. Rodl, Partition theory and its applications, in Surveys in 
Combinatorics, Cambridge Univ. Press, 1979, pp. 96-156. 

J. NeSet¢il and V. Rédl, Two proofs in combinatorial number theory, Proc. 
Amer. Math. Soc. 93, 1 (1985), 185-188. 

J. NeSetril and V. Rédl, Type theory of partition properties of graphs, In: 
Recent Advances in Graph Theory (ed. M. Fiedler), Academia, Prague (1975), 
405-412. 

J. NeSeti#il and P. Valtr, A Ramsey-type result in the plane, Combinatorics, 
Probability and Computing 3 (1994), 127-135. 

C. M. Nicolds, The empty hexagon theorem, Disc. Comput. Geom. 38 (2) 
(2007), 389-397. 

M. Overmars, Finding sets of points without empty convex 6-gons, Disc. 
Comput. Geom. 29 (2003), 153-158. 

J. Pelant and V. R6odl, On coverings of infinite dimensional metric spaces. 
In Topics in Discrete Math., vol. 8 (ed. J. NeSet#il), North Holland (1992), 
75-81. 

G. Pisier, Arithmetic characterizations of Sidon sets, Bull.Amer. Math.Soc. 8 
(1983), 87-89. 


Ramsey Theory in the Work of Paul Erdés 193 


100 


101. 


102. 


103. 


104. 


105. 
106. 


107. 


108. 


109. 
110. 


111. 


112. 


113. 


114. 


115. 


116. 


117. 


118. 


119. 


120. 


121. 


122. 


123. 


124. 


125. 


126. 


D. H. J. Polymath, A new proof of the density Hales-Jewett theorem, Ann. 
Math. 175 (2012), 1283-1327. 

D. Preiss and V. Rédl, Note on decomposition of spheres with Hilbert spaces, 
J. Comb. Th. A 43 (1) (1986) ,38—44. 

R. Rado, Studien zur Kombinatorik, Math. Zeitschrift 36 (1933), 242-280. 

R. Rado, Note on canonical partitions, Bull. London Math. Soc. 18 (1986), 
123-126. Reprinted: Mathematics of Ramsey Theory (ed. J. NeSetril and V. 
Réodl), Springer 1990, pp. 29-32. 

F. P. Ramsey, On a problem of formal logic, Proc. London Math. Soc. 48 
(1930), 264-286. 

V. Rodl, Some developments in Ramsey theory, in ICM 1990 Kyoto, 1455-1466. 
V. Rédl, On homogeneous sets of positive integers, J. Comb, Th. (A) 102 
(2003), 229-240. 

V. Rédl and A. Ruciriski, Threshold functions for Ramsey properties, J. Amer. 
Math. Soc. 8 (1995), 917-942. 

V. Rodl, J. Skokan, Regularity lemma for k-uniform hypergraphs, Random 
Structures and Algor. 25 (2004), 1-42. 

K. Roth, On certain sets of integers, J. London Math. Soc. 28 (1953), 104-109. 
R. Salem and D. C. Spencer, On sets of integers which contain no three terms 
in arithmetic progression, Proc. Nat. Acad. Sci. 28 (1942), 561-563. 

A. Sarkézy, On difference sets of integers I, Acta Math. Sci. Hungar. 31 (1978), 
125-149. 

S. Shelah, Primitive recursive bounds for van der Waerden numbers, J. Amer. 
Math. Soc. 1 (1988), 683-697. 

S. Shelah, Finite canonization, Comment.Math.Univ.Carolinae 37,3 (1996) 
445-456. | 

I. Schur, Uber die Kongruens 2™ + y™ = 2™( mod p), Jber. Deutch. Math. 
Verein. 25 (1916), 114-117. 

I. Schur, Gesammelte Abhandlungen (eds. A. Brauer, H. Rohrbach), 1973, 
Springer. 

S. Sidon, Ein Satz tiber trigonometrische Polynome und seine Anwendungen 
in der Theorie der Fourier-Reihen, Math. Ann. 106 (1932), 539. 

J. H. Spencer, Ramsey’s theorem — a new lower bound, J. Comb. Th. A, 18 
(1975), 108-115. 

J. H. Spencer, Three hundred million points suffice, J. Comb. Th. A 49 (1988), 
210-217. 

T. Tao, The ergodic and combinatorial approaches to Szemerédi’s theorem, In: 
CRM Roceedings & Lecture Notes 43, AMS (2007), 145-193. 

T. Tao and V. Vu, Additive Combinatorics, Cambridge Univ. Press, 2006, 
Xvili+512 

A. Thomason, An upper bound for some Ramsey numbers, J. Graph Theory 
12 (1988), 509-517. 

A. Thomason, Random graphs, strongly regular graphs and pseudorandom 
graphs, In: Survey in Combinatorics, Cambridge Univ. Press (1987), 173-196. 
P. Valtr, Convex independent sets and 7-holes in restricted planar point sets, 
Disc. Comput. Geom. 7 (1992), 135-152. 

B. L. van der Waerden, Beweis einer Baudetschen Vermutung, Nieuw. Arch. 
Wisk. 15 (1927), 212-216. 

B. L. van der Waerden, How the proof of Baudet’s Conjecture was found, 
in Studies in Pure Mathematics (ed. L. Mirsky), Academic Press, New York, 
1971, pp. 251-260. 

A. A. Zykov, On some properties of linear complexes, Math. Sbornik 66, 24 
(1949), 163-188. 


Memories on Shadows and Shadows 
of Memories 


Gyula O. H. Katona 


G.O.H. Katona (5) 
Rényi Institute, Mathematical Institute, Hungarian Academy of Sciences, 
1364 Budapest, Pf. 127, Hungary 

e-mail: ohkatona@renyi.hu 


Iam one of the very few mathematicians who knew Paul’s aunt Irma before 
I knew him. She and my grandmother were neighbors during World War II. 
Aunt Irma was one of the few Jews in Budapest who survived the holocaust. 
This is how I met her since I was raised from this time by my grandmother and 
my aunt. Aunt Irma must have had good memories about my grandmother 
since they kept a good relationship, she regularly visited my grandmother 
even after her move to another place. She learned about my interest in 
mathematics and suggested I meet her nephew Pali who happened to be 
a mathematician. “Of course” I had never heard of him, but I was very glad 
to meet an old “real mathematician”. He was a very respectful old man (46!). 
I immediately understood that I was seeing an extraordinary personality. 
He gave me three problems to solve. I remember one of them. 


How many numbers can we choose from the set {1,2,...,2n} without 
having a number and its proper divisor in the set? 


He probably had in his mind that I should call him the next day with the 
solutions. I took the problems very seriously. But only with my pace. It was 
summer. The summer between high school and university studies. I had to 
do so much. But I regularly returned to the problems, I solved all of them by 
September and reported the solutions to him. 

Next time when Aunt Irma visited us forwarded the following message: 


He is probably very talented in other fields but not so much in 
mathematics. 


It did not touch me deeply. I had a lot of self-confidence. I won the 
National Olympiad in mathematics for high school students and was a 
member of the Hungarian team at the first International Olympiad. But 
I should have taken his opinion more seriously! At least concerning my pace 
and assertiveness. Is it too late? 

I started my studies. Beside my regular classes, I attended several special 
lectures and seminars. Turdn’s seminar proved to be the most important for 
me. In spite of his negative (perhaps non-positive) opinion on my abilities, 
Uncle Paul (in Hungarian: Pali bacsi (=Pauli baachy)) did not forget about 
me. He handed me a reprint of the famous Erdés-Ko-Rado paper [1], saying 
that there are some open problems in it. I was in my second year of studies, 
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Thad a joint result with my friend Domokos Szasz, but no one was too much 
interested in it because the problem was posed by ourselves. So I started to 
read the paper which was not without difficulties: I did not know English. 
The influence of this paper on my mathematical life was decisive. 

Let me remind the reader what the main theorem of this paper was. 


If we have a family of k element subsets of an n-element set, 2k >n 


and any two sets meet then the size of the family is at most (= 


This theorem became one of the centers of my interest. Much later, in 
1971, I found an elegant proof of it [5]. The open problem I started to work 
on was the following. 


Determine the largest family of subsets of an n-element set if any 
two of the sets meet in at least | elements. 


I spent all my free time in a period of 3 or 4 months thinking on this 
problem. Knowing my pace, this is not so much! Let me show you what I 
observed. 

For sake of simplicity consider only the case when | = 2 and n is even. 
The conjectured optimal family for this case was the family of all subsets 
having at least une elements. A family satisfying the conditions cannot 
contain the empty set or a one-element set. The total number of the 2-element 
and n — 1-element sets is maximum when all n — 1-element ones are chosen. 
The total number of the 3-element and n — 2-element sets is maximum when 
all n — 2-element ones are chosen. In general, it seemed that the total number 
of the z-element and n —7+ l-element sets was maximum when all n—7+1- 
element ones are chosen. That is, if we have m sets of size 7 in the family 
then they push out at least m sets of sizen —i+1. A set of size 7 pushes out 
exactly the complements of its i — 1-element subsets. So, it would be enough 
to prove that the number of 7 — 1-element subsets of 7-element members of 
the family is at least m. 

Let us repeat this more formally. Let A be the family satisfying the 
condition of the problem and let A; be the subfamily of its i-element members. 

The shadow (this name was introduced later by someone else) of A; is 


o(A;) ={B: |B] =i-1,BC Ac Ah. 


We need |.A;| < |o(A;)|. After several months of work I managed to prove a 
somewhat stronger statement: 


If any two members of the family A; meet in at least 2 elements then 
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The left hand side is 4, therefore (1) implies that the size of the shadow 
is larger than the size of the original family. This gives the solution of the 
problem. The case of general | is much the same [3]. 

I was extremely happy and reported the result to Uncle Paul (that time 
Professor). He seemed to be satisfied and mentioned some consequences of 
“my theorem”. Moreover, he invented an unusual reward for me. He invited 
me for lunch in a very nice hotel. The name of the hotel was Red Star (V6rés 
Csillag). It was on the top of a larger hill. Its restaurant was partially open air 
and had a fantastic view on Budapest. Of course, his mother (Anyuka) was 
with us. I was on one hand very proud on the other hand embarrassed. This 
was the first time I ate in a restaurant. Uncle Paul probably did understand 
my embarrassment and helped me to choose the food. 

Equation (1) determined the minimum of the ratio of the size of the 
shadow and the original family. It is easy to see that this estimate is sharp 
when the size of the family is eure However it is not sharp if the number of 
sets is different. It was disturbing that I could not determine the minimum of 
|o(A;)| when |.A;| was given. However I did not see any nice construction that 
could be conjectured to be the optimum. Then I realized that the problem 
makes sense without the intersection-property, too. 

From this time (about 1962) I concentrated on this problem and after 2 or 
3 years I found a complicated inductional solution. I presented my theorem in 
1965 in a lecture organized by the Janos Bolyai Mathematical Society. This 
became my best known result [4]. Let me formulate it in a special case, only. 


The minimum of |o(A;)| under the condition that |A;| = (%) for 
some fixed integer a is (ae The optimal construction is the family 
of all i-element subsets of an a-element set. 


Although it was not asked or conjectured by Erdés, it was an indirect 
consequence of Uncle Paul giving me his reprint. Later Branko Griinbaum 
called my attention to the fact that Kruskal [6] proved the same theorem 
earlier. His motivation was coding, therefore the combinatorial world was 
not aware of his result. Both (very different) proofs were quite lengthy. Many 
authors tried to find simpler proofs. Probably the shortest one is due to 
Frankl [2]. 

A young student from Greifswald (that time German Democratic 
Republic) spent a few months in Budapest in 1990 as an exchange student. 
I suggested to him to think about the following problem. What is the 
minimum size of the shadow if |A;| is fixed, like before, and the family A; 
has a system of distinct representatives, that is, one can find elements in each 
member of A; in such a way that these elements are distinct. (More precisely, 
I asked a somewhat different problem and he found this nicer variant.) After 
3 years of working on it he found the solution what I would formulate here 
only in a special case. 
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Theorem 1 (Leck [7]). Suppose that |.A;| is fixed, is of the form m(i+ 1) 
and A; has a system of distinct representatives. The minimum. of |a(A,)| is 
m(°S!) under these conditions. The optimal construction is a disjoint union 
of m(i + 1)-element sets containing all i-element subsets. 


The underground stream of problems started by Uncle Paul has reached 
Uwe Leck. But it is my responsibility to find a reward for him. Should I 
take him to Hotel Red Star? Its name is Golf now. And Uwe Leck has since 
become a “rich (West-)German”. I do not think lunch in a restaurant would 
be a great experience for him. Any suggestions? 
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Summary. P. Erdés and R. Rado defined a A-system as a family in which every 
two members have the same intersection. Here we obtain a new upper bound of the 
maximum cardinality y(n) of an n-uniform family not containing any A-system of 
cardinality 3. Namely, we prove that for any a > 1, there exists C = C(a) such 
that for any n, 


y(n) < Cnla~™. 


1. Introduction 


P. Erdés and R. Rado [2] introduced the notion of a A-system. They called a 
family H of finite sets a A-system if every two members of H have the same 
intersection. 

Let y(n) (respectively, y(n)) denote the the maximum cardinality of an 
n-uniform family (respectively, intersecting n-uniform family) not containing 
any A-system of cardinality 3. 

P. Erdés and R. Rado [2] proved that 


2”n! > y(n) > 2” 
and conjectured that 
y(n) <c” for some absolute constant c. 


The best published upper bound for y(n) is due to J. Spencer [3]: 


[4 


p(n) < eo nl, 


Z. Firedi and J. Kahn (see [1]) proved that 
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y(n) < eV" nl. 
The aim of the present paper is to prove 


Theorem 1. For any integer a > 1, there exists C = C(a) such that for 
any n, 


p(n) < Cnla~”. 


We follow here the ideas of J. Spencer [3]. In particular, in the course of 
proofs some inequalities are true if n is large in comparison with a. And we 
choose C' so that the statement of the theorem holds for smaller n. 


Remark 1. Certainly, since the statement is true for any constant a, it is 
also true for some function on n tending to infinity. In fact, along the lines 
of the proof of the theorem one can prove that there exists a positive constant 
C such that 


30 log log log ") a 


< Cn! 
e(n) < Cn ( log log n 


(All the logarithms throughout the paper are taken to the base e.) It is enough 
to change slightly Lemma 2 and to take k = | =>, (log log n)?| in Sect. 3. 


2. Preliminary Lemmas 


Call a family F of sets a (3,n,k)-family if it is an n-uniform family not 
containing any A-system of cardinality 3 such that the cardinality of the 
intersection of each two members of F is at most n — k. 


Lemma 1. For any (3,n, k)-family Ff, 
! 
n—k+1 n. 
|F| <2 " 


Proof. We use induction on n—k. Obviously, any (3, k, &)-family has at most 
two members. Hence the lemma is true for n —k = 0. 

Let the lemma be valid for n—k < m—1 and F be a (3, m+k, k)-family. 
Choose in F two edges A; and Ag with minimum cardinality of intersection 
and denote Z = A; U Ag. Then each A € F has a non-empty intersection 
with Z. 

For any x € Z, let F(z) = {A € F|x € A}, F(x) = {A\ {a}|A € F(a)}. 
Then for any x € Z, F(x) is a (3,m+k—1,k)-family. Thus, 


(m—1+k)! (m +k)! 
ne ees 


< gmt+l 
k! ~ k! 


IF < So |F(@) < |Z|2” 


TEL 


Lemma 2. If y(k) < Ckla~*e~° for any k <n, then y(n) < Cnla~”. 
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Proof. Let F be a (3,n,1)-family with |F| = y(n) and A € F. For any 
X CA, the family F(A, X) = {B\X|Be F&BNA= X} isa (3,n—|X], 1)- 
family. Moreover, if for some X C A, there exist two disjoint sets B, \ X, 
By. \ X € F(A,X) then A, By and By form a A-system. Hence F(A, X) is 
an intersecting family and 


ns FAs O({)e- )<Cnb™ 80% <cnla™. 


XCA i=0 
From now on, we suppose that for each m <n—-1, 
y(m) < Cmla~™e~%, (1) 
p(m) < Cmla~™. (2) 


In view of Lemma 2 it is enough to show that (1) holds for m =n. 

The following observation from [3] will be used throughout the paper. 
Let B,,...,B, be disjoint finite sets and F be a (3,n,1)-family such that 
|AN B;| > b; for each A € F. Then 


F< i ae (etn — bs sieouhy. (3) 


Lemma 3. Let 0 < r < k < n/2 and for any members Aj,..., A, of a 
(3, n, 1)-family F, 


|A, U...U A,| < rn — kr? /2. (4) 
Then 
n! 
|FI < CT 


Proof. For r = 1 the lemma is valid since (4) is impossible as r = 1. Suppose 
that the lemma is true for r < s —1 and |F| > Cn!/k!. By the induction 
hypothesis there exist A;,...,As—1 © F such that for the set B = A, U...U 
A._1 we have |B| > (s — 1)n — k(s — 1)*/2. If the lemma does not hold for 
F then for any A € F, 

|AN B| > n+ ((s —1)n — k(s — 1)?/2) — (sn — ks? /2) = k(s — 1/2), 


and there is ani, 1 <i < s—1 such that |AN A;| > k. Thus by (3), 
IFi<(s-0)(,) ,)e@-k-D< 
= Raaye ~ 
1 n! 


n gn: 
= _— pig rn tktl — Anig tet ni 
(s n(," Jew k—1)la Cnla kan! < eax 


Lemma 4. Let € > 2,1 <t <s <n and F be a (3,8,1)-family with 
|F| > Csl€-*. Then there exist F’ C F and X such that 
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(1) |X| = 8-1; 
(2) For any AE F', AD X; 
(3) |F'| > Ct!e-*, where B = (4€)8/*. 


Proof. Case 1. For any AEF, |{BeF||BNA| > s—t}| < Ctl258-* —- 1. 
Then a simple greedy algorithm gives us a (3, s,¢+ 1)-subfamily of F with 
cardinality at least 

|F| si(4g)° 558! 


CtlasB-ttN(2e)s tN 
But the existence of such a big (3, s,¢ + 1)-family contradicts Lemma 1. 
Case 2. There exists A € F such that |{B € F| |BN A| > s—t}| = 
[Ct!2°8-*]. 
Then for some X C A with |X| = s —t we have 


-1 
{Be F|BNAD X}| > [Ctl28873 (, - ,) > CHB". 


This is the family we need. 


3. Main Construction 


Let F be an intersecting (3,7, 1)-family with |F| = y(n). The idea is to find 
a (not too large) family of collections of disjoint (and considerably small) 
sets such that each member of F intersects each set from some collection and 
then apply (3). We put 


y = |n/3al], m=3a—-1,k= = log og n , © = |loglogn]. 
log n 
Lemma 5. For all s = 0,1,...,m and for i9 = 1 and any iy,...,%5 € 


{1,...,r} there are subfamilies F(1,t1,...,is) of the family F and sets 
X(i1,...,45) and Z(1,i1,...,%s-1) such that for any s = 1,...,m and for 
any i1,...,%45,0, € {1,...,r}, 


(1) F (1s 415042549) C F(1,%1,.--,%s-1); 
(2) For all A€ F(1,i1,...,%5), 


AD X(t) UX (i1, 72) U...U X (a1, 22,-.-, 45); 


(3) The sets X(t1), X(t1,%2),..., X (41, 72,..., 45) are pairwise disjoint; 
(4): |X Cay tay coy ts) |= yp 

(5) |Z(1, 71, 72,.--,%5-1)| < kr(r + 1)/2; 

(6) X (i4, 12, 48 »ts—1, 4s) OX (i1, ta, ties ,ts—1,%,) Cc Z(1, in, 22, is .yts—1); 
(7) \F(A,t1,.--,%s)| > C(n — sy)!€89-",, where 


& = ((20) 78 


n+(n—-y)t+...+(n—=(s—1)y) ) 
m—sy 


a1 
= pears eDy2 acer 
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Proof. We use induction on s. Put F(1) := F, & := 2a. 

Step s (0 <5 <m). We have at hand F(1,i1,...,2,) for any i1,...,%5 € 
{1,...,r} and if s > 0 we also have sets X(i1,...,%s) and Z(1,%1,...,%s—1) 
as needed. Consider 


FSF tic sta) 
= {A\ (X(i1) U X(t, ig) U...U X (41, d2,...,%5))|A € F(1, t1,..., ts) }. 
According to the statements of the lemma, F is a (3,n — sy, 1)-family. Note 
that 


(ns — s(s — 1)y/2)/(n — sy) < ns/(n — (m—l)y) < ——— 


= Se Bay, 
n— (n— 2y) 3a) 
Hence €, < (2)38~g90 and due to Statement 7 of the lemma, we can use 
Lemma 4. This Lemma 4 provides that there exists X, of cardinality y and 


n 


Hy C F with |Hy| > C(n—(s+1)y)!B+DY-” (where 8 = (4€,)*=G*5¥ ) such 
that any A € H, contains X;. We put Z, := 0. Suppose that (3,n — sy, 1)- 
families H1,...,#,; and sets X1,...,X1,Z1,...,Z, are constructed and that 
foreach l<j<l1l<j’<lj#F#j7, 

(i) |Xj| = y; 
(it) |Z] < kU 1)/2: 
iii) XxX; NX} C Li; 

) 

) 


( 
(iv) For any A € H,, X; C A; 


(v) |Hj| > C(n— (8 + L)yeShe. 


Ifl<r then we construct Hj41, X741 and Z}, as follows. Remark that 
for each A € F, we have |A| = n — sy > lk and the number of A € F with 
|AM (X1U...U X7)| > lk does not exceed (by (3)) 


Ai Uos UX 1 
E i lk oI ) on — sy — th = (Pen sy — ilar 


Ik 
eln 7 C(n — sy)! 
_ _ 1 sytlk—n me. 
7 (=) a a S an—suKlk 
But for large n we have 


nlog log n 


Tog 7 
kk > (=) > ec 0-5 log log n > Qe”: 


Hence for the family H’ := {A € F | |AN(X,U...U X))| < Ik} we have 
|H’| > |F| — C(n — sy)!(2€,)8¥-” > C(n — sy)!(2€,)°¥—-". Then by Lemma 4 
there exist Hii, C H’ and Xi41 with |Xi41| = y such that each A € Hi41 
contains Xj41 and |Hi41| > C(n — (s + 1)y)'(B)StD9-”, where 
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may ao\ the 
B= (4x 2€,)"-GtDv = (s Y, ((2ayrgreee-Du/?) ) 


= ((2a)"Brs+))—s(6+1)u/2) CFT — E44, 
By definition of H’, 
|Xi41 1 (X1U...U X7)| < Ik. 


Putting Zj41 := ZU (Xi44 NM (X1 U...U X1)), we have \Zi41| < \Zi| +1k< 
kl(1+1)/2 and conditions (i)—(v) are fulfilled for /+ 1. Thus we can proceed 
till=r. 

After constructing H,., X,. and Z, we put for j=1,...,7r, X (41, i2,-.-, 48,9): 
=X;, and 
Fine iy tind) = {AUX (i1) UX (41, 72) U... UX (in, fa, «45, 9)|A € H;}, 
and 

Z(1,i1,..., 18) = Zp. 


By construction, the statements 1—7 of the lemma will be fulfilled for 
s+l. 


Lemma 6. For all s = 0,1,...,m+1 and for any 4,...,is € {1,...,r} 
there are sets X(i1,...,%5) and Z(1,i1,...,4,-1) and for any %1,...,¢m41 € 
{1,...,r} there are sets A(i1,...,im+41) € F such that 


(1) The sets X(i1), X(i1, 22),..., X (41, i2,.--,%m41) are pairwise disjoint; 

(2) |X(t1, %2,...,4s) | =y ifl<s<m; 

(3) |X (t1, t2,---5¢m+1)| =n—my; 

(4) A(t, ee yim4i1) = X (11) U X (i1, 72) Woda Y X (i1, ta, ee pim4i); for any 
s=1,...,m and for any i1,...,%5,t, € {1,...,r} 

(5) X (t1, 12,.--,%s—1, 4s) OX (41, t2,---, 45-1, 04) Cc Z(1, i1, t2,---,%s-1); 

(6) |Z(1,%1,%2,.--,%s—1)| < kr(r +1)/2. 


Proof. For s = 0,1,...,m and for any 71,...,7, € {1,...,r}, consider 
F(l,i1,..-,%s), X(é1,..-,%5) and Z(1,7%1,...,%s-1) from Lemma 5. 
Now, for an arbitrary (m+ 1)-tuple (1,%1,...,%m), consider 


H =H Lt, xy tm) 
= {A \ (X (41) U X (i, 72) We. X (t1, t2, ieee ,im))|A E F(1,i1, sare ce 
By construction, H is a (3,n — my, 1)-family and by Lemma 5, |H| > C(n — 


my)lEmy—n, 
Recall that m = 3a — 1, and n— my > y > n/(8a) — 1. For large n, 


gay = Bayar hale < (2a)"3"™ < (Qag%-1)\? < kl. 
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Hence |H| > C(n — my)!/k! and by Lemma 3 (note that 0 < r<k < 
(n/3a—1)/2 < (n — my)/2) there exist Ai,..., A, € H such that 


|A, U...U A,| > r(n — my) — kr? /2. (5) 
Let us denote 


Zl 443 2,4) = U Aj M Ah, 
1<j<h<r 


and for 7 =1,...,7, X(i1,72,...,%s,9) = A; and 
A(ii,---; 4m; 9) = X(i1) UX (i, i2) VU... U X (a1, t2,..-, 4m) U Aj. 


In view of (5), |Z(1,%1,-.-,4m)| < kr?/2. Now, by Lemma 5 and the 
construction, all the statements of the lemma are fulfilled. 


Lemma 7. For each A € Ff, there exist s, 0 < 5s < m and i,...,15 € 
{1,...,r} such that 
Af) X (it, i2,.--,48,5) #0 VW SAlavcat (6) 


Proof. Assume that for some B € F for each s, 0 < s < m and each 
i1,..-,t5 € {1,...,r}, there exists j*(1,...,7,) such that 


Bl |X Cig tis.cetad geese) = 0. 
Let further go = 1 and for s=1,...,m+1, 


ds iat: -;Qs—1)- 


Then B has empty intersection with every member of the sequence X(q1), 
X(q1,q2), ---, X(q1,92,---,Qm4i). But this means that B is disjoint from 
A(q1, 92,---;Qm+1), a contradiction to the definition of F. 


Completion of the proof of the theorem. Consider 
A= \|J LJ BGptigaisytesa): 
8=1 (1yit,..s4¢—1) 


Clearly, |Z| < (1 +r+r2?+...4r™)kr(r +1)/2 < kr™?? = kr?7t, 
Let € = {AE F| ANZ FO} and 


H(L,i1,...,48) ={AEF\E| AL) X (iy ia,... 54,9) 40 VIE {1,..., rH}. 


By Lemma 6, for each s, 0 < s < m and %,...,i, € {1,...,r} the sets 
X(i1,72,--.,%s,97) \ Z for distinct 7 are disjoint. Hence by Lemma 7, we can 
write F in the form 

m 


F=eU(U U  Mtsin--nia) 


3=0 (1,%1,...,% 
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Let us estimate 
|E| < |Z]p(n — 1) < kr?8***C(n - lat. 


Note that each A € H(1,%,...,%s) should intersect each of r disjoint sets 
X (ii, t2,.. iibes Ll) \ Z, X (t1, t2,- saybas 2) \ Z,. ..,X (1, t2,. os ,ts,1) \ Z. The 
cardinalities of these sets for s <_m are at most y and for s = m are less than 
2y. Consequently by (3), 


|H(1,t1,.-.,%8)] < (2y)"eln — 7) 


Thus, 


m+l r 
|F| < kr?+0(n — lat” + 7 : (=) Cnla-” 


r—1 3 
_n —a { 1-5(log log n)3>*7 e%a, (log log n)3%e% 
< Cnla~"e : 
logn (1.5 — o(1)) Hes log n] 


But for large n the expression in big parentheses is less than one. So, the 
theorem is proved. 
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Summary. For the most part, this article is a survey of concrete results in extremal 
combinatorics obtained with the method of flag algebras. But our survey is also 
preceded, interleaved and concluded with a few general digressions about the 
method itself. Also, instead of giving a plain and unannotated list of results, we 
try to divide our account into several connected stories that often include historical 
background, motivations and results obtained with the help of methods other than 
flag algebras. 


A Foreword 


When I was asked by the organizers to contribute something on flag algebras, 
I was a bit uncertain at first. The reasons will become clear from the text 
below, but a two-sentence summary is this. In just a few recent years we have 
witnessed a tremendous explosion of activity in this area, and the explosion 
is still ongoing. It does not look (at least to me) quite consistent with the 
inevitable stamp of finality a full-fledged survey is supposed to convey. 

As a consequence, this contribution has a very clear flavor of an 
accounting book. I will try my best to summarize in Sect.3, in a categorized 
and annotated form, concrete results in extremal combinatorics obtained with 
the method of flag algebras so far. Or, in other words, where do we stand 
now, in February of 2013. 

That said, I still feel obliged to say at least a few general words about the 
method itself, and this is where we begin. This introductory part is rather 
loose and informal, and a disinterested reader may proceed directly to Sect. 2. 


* Part of this work was done while the author was at Steklov Mathematical 
Institute, supported by the Russian Foundation for Basic Research, and at Toyota 
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1. The Method 


The theory of flag algebras is supposed to treat in an entirely uniform way 
all classes of combinatorial structures C that possess the hereditary property: 
any subset of vertices of a structure from C gives rise to another (“induced”) 
structure in C. A precise definition at the appropriate level of generality is 
best given in logical terms [63, §2], but for the purposes of this text we 
can safely assume that C is the class of either ordinary simple graphs or r- 
uniform hypergraphs (r-graphs) or oriented graphs (orgraphs). In this section 
the specific choice of the class C is almost never important, and for simplicity 
we will use the word “graph” cumulatively. 

The main quantity studied in the part of extremal combinatorics that is 
amenable to the method of flag algebras is the number i(H,G) of induced 
copies (up to automorphisms of H) of a graph H in a larger graph G. One 
of the most basic paradigms underlying the theory of flag algebras tells us to 
normalize whenever possible so we immediately replace these numbers with 
the corresponding densities and let 


-1 
pHa) (F) aaa) £2 ve), ¢% VED). 
One useful interpretation is that p(H,G) is the probability that a randomly 
chosen ¢-subset of V(G) induces a subgraph isomorphic to H [63, §2.1]. 

In many contexts, notably in the theory of graph limits, researchers are 
often interested in the number of all copies, not necessarily induced, and 
sometimes also other variants. It turns out, however, that all these variants 
are essentially equivalent; let us review some simple formulas connecting 
different versions (see [50, Chap. 5.2.2]) as we will occasionally need them 
below. 

Let ind(H,G) be the number of induced embeddings a : V(H) —> 
V(G), that is embeddings preserving both adjacency and non-adjacency. 
Denoting by 


ind(H, G) 
L(L—1)---(L-£+1) 


tind (H, G) e 


the corresponding density, we see that ind(H,G) = |Aut(H)|- i(H,G) and, 
hence, 


|Aut(H)| 
el 
tinj(H, G) is defined similarly to ting(H,G) with the difference that now the 


embedding @ need not necessarily be induced, i.e. it must respect adjacencies 
only. Clearly, 


tind (A, G) = (A, G). 
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tg( HG) = So tnalH',G) = 5 So Aut IPH), (1) 


H'DH H'OH 


and the inverse formula is given by the Moébius transform (see [50, (5.20)]): 


tina(H,G) = S> (-1)I2@OI-IFGDI4 (HG). (2) 
H’DH 


Two more variants, homomorphism density t(H, G) [50] and strong homo- 
morphism density [40, Sect.2.3] are obtained from  tinj(H,G), 
tind(H, G), respectively, by dropping the requirement that the mapping a 
must be injective, followed by an obvious re-normalization. They are related 
to each other via formulas completely analogous to (1), (2). There is no neat 
formula, however, relating “injective” densities p(H, G), tina(H, G), tinj(H, G) 
with their non-injective versions: any such formula must necessarily involve 
the number of vertices L, which is grossly inconsistent with the philosophy 
of flag algebras. What is important, however, is that as L — oo, the 
difference between these two classes of measures becomes negligible (see e.g. 
[50, (5.21)]). 

A significant part of extremal combinatorics studies arithmetic and 
Boolean relations existing between the densities p(H1,G),...,p(Hn,G) 
(or sometimes their equivalent versions ting(Hi, G), tinj(Hi,G)) where Hi,..., 
Hy, are small fixed templates, and G is an unknown graph. Sometimes 
problems of interest (like the Caccetta-Haggkvist conjecture that we will 
discuss in Sect. 3.3) also involve concepts like minimal/maximal degree; these 
fit into our framework with very minimal changes. 

And the asymptotic extremal combinatorics additionally assumes that 
the size of G is very large, and thus these relations are to be satisfied only 
in the limit. More precisely, in every increasing sequence G},G2,...Gn,... 
of graphs, we can by compactness choose a subsequence G4,...,Gi,,... 
such that all h limits lim, p(Hy,Gi,) (Vv € [h]) exist; denote them 
by ¢(H1),...,¢(Hn). The question is then re-phrased as follows: which 
properties should the tuple (¢(H1),...,¢(Hn)) satisfy? 

The next observation is that by going to an infinite subsequence we can 


ensure that the limits ¢(H) “ limp... p(H,G’,) exist for all (countably 
many) graphs H, not only those we are actually interested in. This follows 
from Tychonoff’s theorem on the compactness of products of compact 
sets (that in our particular case can be replaced by a simple diagonal 
argument). Such sequences are called convergent, and the function ¢ that 
maps isomorphism classes of finite graphs! is a paradigmical example of what 
in the theory of graph limits is called a simple graph parameter [50, Chap. 4.1]. 


' It is perhaps a good time to remind that in this section we use the word “graph” 
in a broader sense that also includes hypergraphs, orgraphs, etc. 
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Convergent sequences of graphs {G;} and associated graph parameters ¢ 
make the main object of study in both theories: graph limits and flag algebras. 
From this point, however, they diverge significantly: a logician might have said 
that the theory of graph limits is semantical in its nature while flag algebras 
strongly focus on syntax. Indeed, a very substantial part of the theory of 
graph limits deals with the question of what is the actual limit object for a 
converging sequence of graphs and with studying its properties. This limit 
object was successfully described by Lovadsz and B. Szegedy for ordinary 
graphs (graphons, see [50, Chap. 7]), by Elek and B. Szegedy for hypergraphs 
[50, Chap. 23.3], and it looks as if a sort of a description is possible even for 
directed graphs [50, Chap. 23.5]. 

The approach taken by flag algebras is on the contrary manifestly 
minimalistic, which is dictated by the utilitarian purpose of the theory. 
Semantics is substantially demoted as not being very useful for proving 
new concrete results; one immediate advantage of this is that the theory 
can be applied to arbitrary combinatorial structures without any changes 
at all. Instead, it focuses on developing syntactic tools for proving universal 
statements about the quantities ¢(H,),...,¢(H»,) using more or less formal 
manipulations. Careful attention is paid to notational uniformity, simplicity 
and transparency: this is particularly important since, as the experience 
shows, the method begins to bring real fruit dangerously close to the 
region where it becomes unfeasible for purely computational reasons, see 
the discussion in [34, Sect. 4.1]. Another characteristic feature of the method 
is its strong tendency to expose and exploit (usually simple) mathematical 
structure in an uniform way wherever it can be found. Besides obvious 
mathematical connections, this paradigm, somewhat surprisingly, has its 
own non-negligible utilitarian value. For example, it adds versatility to 
some existing packages for working with flag algebras that can be easily 
re-programmed to work with different types of combinatorial objects. 

As we indicated at the beginning, this text is not intended to be an 
exposition of the method itself. Almost all necessary formalism can be found 
in the original paper [63, 67, Sect. 2.1.1] adds half a page of notation and 
definitions that are particularly useful when one has to work with several 
different types of combinatorial structures at once. An informal account 
can be found in [45, Sect.7], and almost every paper with concrete results 
surveyed below also strives to explain its own version of the formalism in its 
own way. But in the next Sect. 3 we will use the distinction between “plain” 
(Cauchy—Schwarz) applications and those using more advanced concepts. So 
we conclude with a somewhat informal account of the fragment of the general 
theory that is necessary to understand this distinction. This part is similar 
to quantum graphs, graph algebras, reflection positivity, etc., studied in the 
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context of graph limits [50, Part 2], but there are also important differences 
dictated, as almost everything else in flag algebras, by pragmatic purposes. 

Let M be the set of all finite graphs up to an isomorphism, and RM be 
the set of all their finite formal linear combinations with real coefficients. 
Then any graph parameter ¢ can be extended by linearity to a linear 
mapping RM — R that we will also denote by the same letter ¢. Graph 
parameters ¢ resulting from convergent sequences of graphs turn out to 
satisfy ¢(f) = 0 for certain elements f € RM expressing the most basic chain 
rule [63, Lemma 2.2]. Factoring out by these relations, we obtain a linear 
space that is denoted by A° (the meaning of the superscript 0 will become 
clear soon). It turns out that for every pair H,, Hz of graphs, ¢(H1)¢(H2) 
can be always expressed as $(f) for an easily computable element f € A° 
not depending on ¢. This allows us to endow A° with the structure of an 
associative commutative algebra [63, Lemma 2.4], and thus ¢ defines an 
algebra homomorphism from this algebra to the reals. It clearly satisfies 
¢(H) > 0 for any graph H, and we let Hom*(A°,R) denote the set of all 
algebra homomorphisms with the latter property. 

One extremely important fact is that at this point our search for 
“weneric”, “logical” relations satisfied by all graph parameters resulting 
from convergent sequences is over. Namely, the “completeness theorem” 
[50, Theorem 11.52], [63, Theorem 3.3] states that every element ¢ € 
Homt(A°,R) can be realized as a convergent sequence of graphs, and this 
allows us to focus on Hom*(A°, R) as an axiomatic description of our main 
object of study. Of course, even under this view, the intended semantical 
interpretation is still indispensable for intuition and is occasionally used in 
arguments (see e.g. [63, Theorem 4.3]). 

The backbone of the theory is made by the real cone C2.,, consisting 
of all those f for which ¥@ € Homt(A°,R)(¢(f) > 0), and what we 
refer to as “plain” Cauchy—Schwarz applications is just a systematic way 
of finding “interesting” elements in this cone by semi-definite programming. 
More specifically, all notions reviewed so far readily generalize to the relative 
framework in which a prescribed number of vertices k spanning a prescribed 
graph o are labeled in all objects under consideration and are always required 
to be preserved [63, §2.1]. o itself is called a type [of size k], relativized graphs 
become flags [of type o], and the relativized version A’ is (finally!) called the 
flag algebra. For every f € A? we clearly have f? € CZ,,,, and we also have a 
naturally defined averaging (or label-erasing) linear operator |-], : A7 —+ A°® 
preserving the set of positive elements: [C2..,Jo C Co. [63, Theorem 3.1]. 


sem 
This already provides us with a supply of non-trivial elements in C2... of the 
form [f?]. (f € A®%), and we can also take their linear combinations with 
non-negative coefficients. The resulting set is a quadratic sub-cone C° C C2. 
defined by positive semi-definite constraints. And when the size of all flags 
involved is bounded by a constant ¢ (in a “typical” plain application of the 
method ¢ varies between 4 and 6), the corresponding SDPs become finitely 


defined, and, what is even more important can be handled by the existing 
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solvers? sufficiently well to actually solve problems. This is what we will refer 
to as the “plain” method, and in what follows we will use the word “plain” 
in this rather technical and exact sense. 

The structure that can be extracted from the objects Hom (A°,R) 
is, however, much richer than that and includes other things like various 
algebra homomorphisms allowing us to move true statements around [63, 
§2.3], ensembles of random homomorphisms extending a given one [63, §3.2] 
or variational principles [63, §4.3]. We can not go into details here, but in 
Sect.3 we will sometimes mention these structures by name whenever they 
are used in the arguments. 

What are the relations between the cone C2,,, we are interested in and 
its approximation C° corresponding to what we can prove using Cauchy— 
Schwarz arguments? Topologically, C° is dense in C2,,,, and one does not even 
have to use quadratic relations for that. Namely, it is a simple consequence 
of the completeness result [50, Theorem 11.52], [63, Theorem 3.3] that the 
linear subcone in C° consisting of non-negative linear combinations of flags 
is already dense in C2... 

In terms of logical complexity, however, the difference is huge. If we, 
for simplicity, focus on rational points in these cones, then the sub-cone 
C? C C® consisting of all inequalities provable by using only ¢-sized flags is 
decidable and hence C° = U, C? is recursively enumerable. The fundamental 
result by H. Hatami and Norin [44] states that C2, is not r.e. already for 
ordinary simple graphs. Informally, this means that every proof system that 
will try to generate true statements in the asymptotic extremal combinatorics 
will necessarily be incomplete. Very recently, Lovett, H. Hatami, P. Hatami 
and Norin have extended this result to the theory of 2-colored graphs with 
distinguishable parts. 

Finally, the theory of flag algebras has not appeared overnight out of 
nowhere, it had many predecessors. First of all, most constructions and 
arguments are modeled after their discrete counterparts that have been used 
in extremal combinatorics for many decades. Next, one should definitely 
mention the method of Lagrangians [55] that was perhaps the first successful 
usage of analytical methods in the area. Quasi-random graphs [16] are, in our 
language, devoted to the study of one specific and, arguably, the most natural 
element of Hom*(A°,R), and many central results and proofs there have a 
distinct syntactic flavor. Bondy [9] used what we would now call “Cauchy— 
Schwarz calculus” in the specific context of the Caccetta—Haggkvist problem. 


? In my own work, I interchangeably use CSDP [10] and SDPA http://sdpa. 
sourceforge.net/, and my special thanks go to their developers. 
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2. Notation 


We review the main definitions for the case of simple r-uniform hypergraphs 
(r-graphs in what follows), where r > 2 is a fixed number. We will also be 
occasionally considering oriented graphs,’ but it is very straightforward how 
to adopt our definitions to that case. 


2.1. Turan Densities 


For two r-graphs F and G, G is F’-free if it does not contain (not necessarily 
induced) subgraphs isomorphic to F’. Given a family F of r-graphs, G is 
F-free if it is F-free for every F € F. Let exy(n;F) be the maximal 
possible number of induced copies of an r-graph H in an F-free r-graph 
on n vertices and 


exy(n; F) 


tH(F) = lim 
ae Gvin) 


n> co 


In the language of flag algebras, 7 (/) is the maximal possible value of ¢(H), 
where the maximum is taken over all ¢ € Homt(A°,R) for which ¢(F) = 0 
whenever F' contains a spanning subgraph isomorphic to some F' € F. We let 


def 
™(F) = me}(F), 
where e is a single (hyper)edge. For better understanding the context of 
this survey, it is useful to recall that in the case of ordinary simple graphs 
the quantities 7(F) are completely described by the Erdés—-Stone-Simonovits 
theorem [26, 27]: 


(3) 


where r “2 min {x(G) | Ge F}. 
In order to cover more situations of interest, we define expin,a(n;F), 
Tmin,H(F) analogously to 7(F), but with the following two differences: 


1. We are interested in the minimal possible number of induced copies of H; 
2. r-graphs from ¥ are forbidden only as induced subgraphs. 


Again, when H is a single (hyper)edge, exmin (7; F), Tmin, a (F) are abbrevi- 
ated to €Xmin(; F), Tmin(F). Very recently, Norin (personal communication) 
was able to give a nice and complete description of tin(F) for the case 


3 That is, directed graphs without loops, parallel or anti-parallel edges. By 
analogy with the abbreviation “digraph”, in this survey oriented graphs will be 
often called orgraphs. 
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of ordinary graphs. More generally, given a finite set F of graphs he fully 
describes the set D(F) C [0,1] consisting of those x € [0,1] for which there 
exists ¢ € Homt(A°, R) with ¢(F) =0 (F € F) and ¢(e) = x. The situation 
for 3-graphs is very different, and some related results will be thoroughly 
discussed in Sect. 3.4. 

When F = {F} consists of a single graph, the quantities 7_(F) and 
Tmin,H(F’) can be readily generalized to their relaxed versions when instead of 
forbidding copies of F' entirely, we are interested in minimizing their number. 
For example, given x € [0,1], we let 


def ex a(n; F 
ofl) Stim Se) (4) 
neo (iy (my) 


where exy7,,(n; F) is the minimal possible density of copies of F' in an r-graph 

on n vertices, in which the density of (induced) copies of H is at least x. 

Thus, gf; is a non-decreasing function and 74(F) is the maximal x for which 
F 

Ge (x) = 0. 


2.2. Frequently Used [or] Graphs 


Ke is a clique on £ vertices, Jy is an independent set on @ vertices, Ce [Cz] 
is a non-oriented [oriented, respectively] cycle of length @, and P [P,] is a 
non-oriented [oriented] path on ¢ vertices, i.e., of length (¢— 1). Ki,¢ is the 
oriented star on (€+1) vertices in which all edges are oriented from the center. 


2.3. Frequently Used Hypergraphs 


kj is a complete r-graph on ¢ vertices, and J} is an empty r-graph on ¢ 
vertices (thus, Ke = K? and Ip = J7). Jy, is the 3-graph on (k + 1) vertices 
consisting of all ($) edges that contain a distinguished vertex v. Gy is the 
uniquely defined 3-graph on 4 vertices with ¢ edges; thus, G, = K?, and G3 
is often denoted by Ky. Cs is the 3-graph on 5 vertices with the edge set 
{ (123), (234), (345), (451), (512)}. F3 9 is the 3-graph, also on 5 vertices, with 
the edge set {(123), (145), (245), (345) }. 

F is the edge-complement of a (hyper)graph F (on the same set of 
vertices). For a (hyper)graph F, X(F) is its Lagrangian defined as the 
maximal possible edge density of all weighted hypergraphs resulted from 
placing probability distributions on the vertices of F’. 


3. Results 


In our survey of existing results, we are trying to group them into a few 
large groups centered either around a “big” problem or a reasonably broad 
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topic. In all these cases the contribution made by flag algebras has been 
very substantial, but seldom it was exclusive. Therefore, we feel that our 
purpose will be served better if we give more coherent account by including, 
whenever appropriate, historical context, motivations, results proved by other 
methods, etc. 


3.1. Clique Densities 


In this section we consider only simple ordinary graphs, and we are interested 
in the functions Oe (see (4)). The case p = 2 has received most attention, 
and we abbreviate 


Q 
h 


gr(x) 8 g(x. 
In words, g;(x) is the (asymptotically) minimal possible density of K;, in 
graphs with edge density > x. 

The first general bound on g,(x) was proved by Goodman [35]: 


g3(a) > «(2a — 1); (5) 


in the framework of flag algebras his proof amounts to a one-line calculation 
(63, Example 11]. This result was later rediscovered by Nordhaus and Stewart 
[58] who also conjectured that 


g3(x) 2 52a — 1). (6) 


wl rw 


Goodman’s bound (5) was extended to the case r = 4 by Moon and Moser 
[52] as follows: 


ga(x) > «(2x — 1)(3x — 2) (a > 2/3). 


Following the pattern, they also stated without proof the natural generaliza- 
tion 


wo) > [lee 0) («21-—) (7 


r— 


for an arbitrary r; a complete proof was later provided in [48, 51]. 
Values of the form x = 1 — + are called critical. These are precisely edge 
densities of complete balanced t-partite graphs, and at critical values the 
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right-hand side of (7) computes the densities of K, in these graphs. Thus, 
the bound (7) (and its partial case (5)) is tight at the critical points 1 — $; 
the question is what is happening between them. 

The bound (7) is convex. Let w,(a) be the piecewise linear function 


that is linear in every interval }1—7,1—4y| and coincides with g, at 


its ends. Then, by convexity, w-(a) > g,(x) (note that in the interval 
[1/2, 2/3] the bound conjectured in (6) is precisely 73(x)). More generally, 
let w(x) be the piecewise linear function that is linear in every interval 


EB (1 = +) +9p (1 = 4) and coincides with g, (1 = +) Or (1 = 4) at its 
ends. 

In the beautiful paper [8], Bollobds proved that ~?(x) still provides a 
lower bound on the function Ge 


94 (x) > PP(a). (8) 


A brief survey of these and related early developments can be found in [7]. 

We are not aware of any improvements on Bollobas’s bound (8) for p > 2, 
which, in our opinion, makes an interesting open problem. The follow-up 
research concentrated on computing the functions g,(x). 

As for upper bounds, let us consider a complete (¢ + 1)-partite graph 
in which t parts are of the same size while the remaining part is smaller. 
aap 
this class with the edge density x. Computing the density of K,. in it leads 
to the following (somewhat ugly) upper bound on g,(z): 


there exists an asymptotically unique graph in 


Given x 1 + 1 


Gr(@) S oC” (t (r— 1) Vt(t— a(t + 1) 


(t+ Vié-2@4+D)) (we [1-4,1-4)). 


as 
t+ |° 


I was not able to trace the origin of the conjecture that the bound (9) 
is actually tight, but in explicit form it appears already in the paper [51] 
by Lovasz and Simonovits. The same paper proved the conjecture in some 
sub-intervals of the form [1 _ , 1- + + Er,t] where €,, is a (very small) 
constant. The next development occurred in 1989 when Fisher [30] proved* 
that (9) is tight for r= 3,t = 2. 

And this is where flag algebras entered the stage. Firstly, Razborov [63, 
§5] independently re-discovered Fisher’s result. More generally, a relatively 


This bound is concave in the interval [1 - +, 1- 


4 Fisher’s proof was incomplete as it implicitly used a fact about clique 
polynomials unknown at the time. This missing statement, however, was verified 
in 2000. 
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simple calculation [64, (3.6)] shows that if the bound (9) is tight for some t 
and r = 4, then it is also tight for the same value of t and r = 3. Fisher’s 
result follows immediately since (9) is tight when t = 2,r = 4 (both sides are 
Zero). 

Then, using much more involved flag-algebraic constructs and calcula- 
tions, Razborov [64] proved that the bound (9) is tight for r = 3 and an 
arbitrary t. In the classification scheme outlined in Sect.1, this proof is 
certainly not plain, and in fact it barely uses Cauchy—Schwarz at all. Instead, 
it significantly employs more elaborated parts of the theory like ensembles of 
random homomorphisms or variational principles; we can not go into further 
details here. 

While the next two papers do not directly use the language of flag algebras 
(see, however, the discussion at the conclusion of Sect. 1 in [69]), the proofs 
are still highly analytical. Nikiforov [57] proved that (9) is tight for r = 4 (and 
any t). And, finally, Reiher [69] established the same for arbitrary r,t thus 
completing the quest for computing the function g,. itself. Let me, however, 
remind here again that no progress on the relative values In for p > 2 has 
apparently been made since Bollobds’s seminal paper [8]. 

As for exact bounds, infinite blow-ups in general provide a powerful tool 
for converting asymptotic results into exact ones. In our context (we will 
discuss one more case in Sect.3.5) this simple idea immediately implies the 
bound 


ca 

eXe.2(n; K,) > “5 (=) (10) 
[64, Theorem 4.1]. Nikiforov [57, Theorem 1.3] showed that it is rather close 
to optimal. 

Lovasz and Simonovits [51] made several quite precise conjectures about 
the behavior of exe,,(n; K;) and the corresponding extremal configurations, 
but these conjectures still remain unanswered. A partial progress toward them 
was made by Pikhurko and Razborov in [62]. Firstly, using a genuine flag- 
algebraic argument, they completely described the set 6 C Hom*(A°,R) of 
all asymptotically extremal configurations, i.e., those ¢ for which ¢(i3) = 
g3(¢(K2)). Then, by standard techniques, [62] proved stability, i.e., that 
actual extremal configurations are o(n”)-close to the conjectured ones in the 
edit distance. These are precisely the first two steps of the program that we 
will discuss in the next Sect. 3.2. The third step, however (extracting an exact 
result from the stability version) is still missing. And for r > 3 nothing along 
these lines seems to be known at all. 

In conclusion, let me note again that since [8] and [51] all improvements 
have been very analytical in their nature. Proving comparable results with 
entirely combinatorial techniques remains an unanswered challenge. 
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3.2. Turadn’s Tetrahedron Problem 


In this section we switch gears and work with 3-graphs. The value 7(47) is 
unknown for any pair @ > r > 2, this is the famous Turdn problem. More 
information on its history and state of the art can be found in the recent 
comprehensive survey [45] (see also much older but still useful text [71]). In 
this section we concentrate on the simplest case r = 3, € = 4, with a brief 
digression to the next one, r = 3, € = 5. m(K3) = 1 — mmin(J3), and it will 
be convenient to us (partially for historical reasons) to switch to this dual 
notation. Turan [74] conjectured that tmin(I?) = 4/9, and this conjecture is 
sometimes called Turdn’s (3,4)-problem or tetrahedron problem. De Caen [14], 
Giraud (unpublished) and Chung and Lu [18] proved increasingly stronger 
lower bounds on Tmin(J?), the latter being of the form tmin(Z3) > evi > 
0.406407. 

A plain (remember that we use this word in a technical sense) flag- 
algebraic calculation leads to the numerical bound 


Tmin (Ij) > 0.438334 (11) 


[65] that was verified in [12] and later in [34] using the flagmatic software 
(we will discuss the latter in Sect. 4.1). The scale of this improvement reflects 
a general phenomenon: let me cautiously suggest that I am not aware of a 
single example of a non-exact bound in asymptotic extremal combinatorics 
that could not be improved by a plain application of flag algebras. 

The remaining results in this section were distinctly motivated by the 
structure of known extremal configurations elaborated in a series of early 
papers by Turan [74], Brown [11], Kostochka [49] and Fon-der-Flaass [28], and 
we review it first. Our description (borrowed from [67]) has a rather distinct 
analytical flavor; for more combinatorial treatment see, e.g., [45, Sect. 7]. 

Let [ be a (possibly infinite) orgraph without induced copies of Cy. 
Let F DF(L) be the 3-graph on V(L) in which (u,v, w) spans an edge if and 
only if |¢u,0,} contains either an isolated vertex (i.e., a vertex of both in- 
degree and out-degree 0) or contains a vertex of out-degree 2. Then FDF(T) 
does not contain induced copies of [? [28]. 

Next, let Q © Zs x IR, and consider the (infinite) orgraph Tx = (Q, Ex) 
given by 

Ex © {((a,2z),(b,y)) | (ety <0Ab=atl1)V(xt+y>0Ab=a-1)}. 
Tx does not have induced copies of C4 and hence F DF(T) does not contain 
induced copies of 3. The set of known extremal configurations when the 
number of vertices is divisible by three is precisely the set of all induced 
subgraphs of this 3-graph that are of the form FDF(Ux|z,x5), where S is 
an arbitrary finite set of reals. 

Turan’s original configuration [74] corresponds to the case when S C Rt. 
Brown’s examples are obtained when negative entries in S are allowed, but are 
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always smaller in absolute values than positive entries. Kostochka’s examples 
[49] correspond to arbitrary finite S. And if we replace [the uniform measure 
on] S by a non-atomic measure on the real line, we will get a full description 
of all known ¢ € Hom*(A°,R) with d(1?) = 0 and ¢(e) = 4/9. 

Turan’s original example does not contain induced copies of G3 which 

implies Tmin(I?,G3) < ¢. Razborov [65] proved that in fact 
3 4 
Tmin (Ly ; G3) = 9 (12) 
which also was the first application of the method in its genuinely plain form. 
Baber and Talbot [13, Theorem 25] gave a list of ten 3-graphs {Hi,..., Hio} 
on six vertices for which non-induced results of the same nature hold: 
n(K3, Hj) = 2 (1 <i < 10); their proof method is also plain. 

Pikhurko [59] proved that for a sufficiently large n, Turdn’s example is the 
only 3-graph on which exmin(n; [?, G3) is attained. This was also one of the 
first papers to demonstrate the three-step program for converting asymptotic 
flag-algebraic results into exact ones: 


1. Describe the set of all extremal elements in Homt(A°,R) (which in this 
particular case consists of a single element). 

2. Prove stability, that is that the convergence in the pointwise topology 
described in Sect. 1 can be strengthened to convergence in the edit distance. 

3. Move from stability to exact results using combinatorial techniques. 


Let us now take a brief de-tour and discuss a couple results of similar 


nature inspired by the next case r = 3, € = 5 in Turan’s problem. 
The situation with computing ™pin(J?) itself is very similar to tmin(J}): 
Turdn’s conjecture says that tmin(I3) = 1/4, and there are many non- 


isomorphic configurations realizing this bound. The simplest of them given 
by Turan himself is the disjoint union K? p U K3 j2 of two cliques of the 
same size. Let H,, Hz be the two non-isomorphic 3-graphs on 5 vertices with 
precisely two edges. Then they are missing in Turdn’s example above, and 
Falgas—Ravry and Vaughan proved in [34] that 


Teale Ay, Hz) = 1/4 


which is analogous to (12). Their proof method is plain. 

To review another remarkable result, it is convenient to switch to the 
dual notation. Turan’s construction from the previous paragraph implies that 
m(K3) > 3/4 and, more generally, 7(G) > 3/4 for any 3-graph G that is not 
2-colorable. In particular, this applies to critical (that is, edge minimal) 3- 
graphs on six vertices with chromatic number 3. There are precisely six such 
graphs; one of them being K3 plus an isolated vertex (in other words, Js) 
and, obviously, m(K3) = 7(Js). Quite remarkably, using flag algebras, Baber 
[1, Theorem 2.4.1] proved that 7(G) = 3/4 for every one of the remaining 
five graphs on the list; his proof is plain. 
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We now return to the tetrahedron problem. Clearly, not all graphs without 
copies of I? can be realized in the form FDF(I), and Fon-der-Flaass [28] 
asked whether Turan’s conjecture can at least be proved for 3-graphs of his 
special form. He himself showed a lower bound of 3/7 (superseded by (11)). 
While the Fon-der-Flaass conjecture is still open, Razborov [67] verified it 
under either one of the two following assumptions: 


1. Tis an orientation of a complete t-partite graph (not necessarily balanced) 
for some t; 
2. The edge density of T is > 2/3 — € for some absolute constant e. 


Note that (2) settles a local version of the Fon-der-Flaass conjecture, that 
is proves it in an open neighborhood of the set ® C Homt(A°,R) of known 
extremal configurations. The proof method is a combination of plain and more 
sophisticated techniques heavily based upon working with several different 
kinds of combinatorial structures at once and frequently transferring auxiliary 
results from one context to another. The author expresses his hope that 
this kind of interaction (mostly human reasoning aided in appropriate places 
by the hammer-like power of plain flag-algebraic arguments) will become 
increasingly more popular in the area. 

The result (12) is relevant only to the original extremal example given 
by Turan as all others contain plenty of induced copies of G3. Razborov [68] 
identified three 3-graphs on 5 vertices given by their set of edges as follows: 


E(H,) © {(123) (124) (134) (234) (125)(345)} 


E(Hz) © {(123) (124) (134) (234) (135)(145) (235)(245)} 


E(Hs) © {(123) (124) (134) (234) (125) (135) (145) (235)(245)} 


and proved that 
Tmin (12, Hi, H2, H3) = 4/9. (13) 


The motivation behind this result is that, as induced subgraphs, Hy, H2, H3 
are missing in FDF (Ix) and, thus, in all known extremal configurations. 
Flag algebras are used in this proof only “behind the scene”, but the proof 
method itself deserves a few words here. Let us call a 3-graph H singular if 
its edge set is not a superset of E(F DF(T)) for any orgraph T which is an 
orientation of a complete t-partite graph (cf. the first result from [67] cited 
in item (1) on page 220) and does not contain induced copies of C4. Then 
[68] proved that 


ttH(H), Ho, H3) =0, 


where H is an arbitrary singular 3-graph and 774(F) is defined similarly to 
tH(F), with the difference that only induced copies of elements in F are 
forbidden. The proof uses Ramsey theory, and the main result (13) follows 
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almost immediately from this and the first result in [67]. We are not aware of 
a similar “zero-inducibility” phenomenon that would not have held for some 
trivial reasons. 

In conclusion, [67, 68] provide several results that verify Turdn’s conjec- 
ture for several natural classes containing the set ®puran of all known extremal 
examples. None of them, however, covers an open neighborhood of ®quran, 
and we believe that obtaining such a local result would have been a major 
step toward resolving the unrestricted version of the tetrahedron problem. 


3.3. Caccetta—-Haggkvist Conjecture 


In this section we work with oriented graphs. 

In 1970, Behzad, Chartrand and Wall [4] asked the following question: 
if G is a bi-regular orgraph on n vertices of girth (€+ 1) (ie., Cy-free for 
any k < @), how large can be its degree? They conjectured that the answer 
is [24 and presented a simple construction attaining this bound. Eight 
years later, Caccetta and Haggkvist [17] proposed to lift in this conjecture 
the restriction of bi-regularity and, moreover, restrict attention to minimal 
outdegree only. In other words, they asked if every orgraph without oriented 
cycles of length < @ must contain a vertex of out-degree < not and it is this 
question that became known as the Caccetta—Hdaggkvist conjecture. Like in the 
previous Sect. 3.2 we concentrate on the case ¢ = 3 even if some prominent 
work has been done for higher values of @. 

Let c be the minimal x for which every C3-free orgraph on n vertices 
contains a vertex of outdegree < (c + o(1))n; the Caccetta—Haggkvist 
conjecture then says? that c = 1/3. Caccetta and Haggkvist themselves 
proved the bound c < 3-v5 = 0.382 [17]. In the paper [9] that, as we 
acknowledged in Sect. 1, was one of the predecessors of flag algebras, Bondy 
proved that c < 2v6—3 < 0.379. His proof is essentially what we would call 
here a plain application of the method using C3-free orgraphs on 4 vertices 
(there are 32 of them). However, instead of actually solving the resulting 
SDP, Bondy gives a hand-manufactured (non-optimal) solution to it. Shen 
[70] improved this to c < 0.3543, and Hamburger, Haxell and Kostochka [39] 
proved a bound of c < 0.3532. 

The current record of c < 0.3465 was established by Hladky, Kral’ and 
Norin in [43] using flag algebras. After incorporating an inductive argument 
previously used by Shen in [70], their proof method is mostly plain, but it 
also introduces one more novel and important feature. Namely, [43] utilizes 
a result by Chudnovsky, Seymour and Sullivan [19] on eliminating cycles 
in triangle-free digraphs that is only somewhat related to the Caccetta-— 
Haggkvist conjecture, and adding that auxiliary result to the computational 


° It is well-known that its asymptotic and exact versions are equivalent. 
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TWISTED CIRCLE IN-PENDANT OutT-PENDANT 


Fig. 1 Forbidden orgraphs. 


brew is paramount for the improvement. Again, I would like to express my 
hope that in the future we will see more examples of interaction of this sort 
between different problems. 

As for partial but exact results, Razborov [66] proved the Caccetta— 
Haggkvist conjecture under the additional assumption that the three or- 
graphs on Fig.1 are missing as induced subgraphs. Like in the previous 
section, the point here is that these orgraphs are missing in all known 
extremal configurations; for the description of the latter see [9, Sect. 3] and 
[66, Sect. 2]. The proof is not plain and in fact does not use Cauchy—Schwarz 
at all. Moreover, all concrete calculations are so simple that the proof was 
presented in an entirely finite setting but using flag-algebraic notation. 


3.4. Topics in Hypergraphs Motivated 
by the Erd6és—Stone—Simonovits Theorem 


From this point on, all flag-algebraic proofs we review are plain. Therefore, 
we will normally omit this qualification. 

As we already noted in Sect.2, the Erdés—Stone-Simonovits theorem 
(3) completely settles the question of computing 7(F) for finite families 
of ordinary graphs. But it also implies several interesting structural con- 
sequences pertaining to the behavior of this function. In this section we 
survey a few contributions to the hypergraph theory bound together by the 
general intention to understand how precisely badly this theorem fails for 
hypergraphs. 

To start with, (3) implies that 7(F) = mingers7(F) (one direction is 
obvious), and by analogy with objects like principal ideals, etc. it is natural 
to say that for ordinary graphs the function 7(F) displays principle behavior. 
It is also natural to ask if this is true for hypergraphs, and, indeed, Mubayi 
and Rédl [54] conjectured that non-principal families F (i.e., those for which 
m(F) < minger m(F)) exist already for 3-graphs. They further conjectured 
that they exist even with |F| = 2. 

The first question was answered in affirmative by Balogh [2], but, in 
his own words, “the cardinality of the set F is not immediately obvious”. 
The second question was answered by Mubayi and Pikhurko [53] who showed 
that the pair (K?, Js) is not principal. 
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From the discussion in [54] it is sort of clear that the authors expect 
the pair (G3,Cs) to be non-principal, and to that end they note the known 
inequality [29] 


(G3) 2 (14) 


z 

7 9 

as well as prove new results 7(Cs) > 0.464 and 7(G3,Cs) < 2. 
Using flag algebras, Razborov [65] improved the latter bound to 


1(G3,C5) < 0.2546 < : 


thus proving that (G3,Cs5) is indeed a non-principal pair. Then Falgas—-Ravry 
and Vaughan [34], also using flag algebras, proved that the pairs (Gs, F3,2) 
and (K#, J,) are non-principal. The former example is remarkable since they 
were also able to compute 


™(G3, F3,2) = a 
and 77(F3.2) = 4 had been known before [31] (for a several-line flag-algebraic 
proof of this result see [65, Theorem 5]). Nonetheless, 7(G3) is still unknown, 
and it is interesting to note in this respect that we still do not know of any 
example of a non-principal family F for which we can actually compute all 
involved quantities 7(F) and a(F’) (F' € F). 

Another obvious consequence of the Erdés—Stone-Simonovits theorem is 
that for ordinary graphs, 7(¥) is always rational. The book [15] mentions 
the conjecture, believed to be due to Erddés, that this will also be the case 
for r-graphs. This conjecture was disproved using flag algebras by Baber and 
Talbot [13] who gave a family of three 3-graphs F such that 7(F) = A(F3,2) = 


189-1516 | It was also independently disproved by Pikhurko using different 
methods [61], but his family F is huge. 

Yet another consequence of the Erdés—Stone—Simonovits theorem is that 
in case of ordinary graphs, for any a € [0,1) the density bound 7(F) < a 
can be forced by a finite family F such that all graphs G € F have larger 
density > @ for some fixed 6 > a. Moreover, the graphs G € F can 
be made arbitrarily large, and (this is important!) 6 does not depend on 
minges |V(G)|. For example, if a € [1/2, 2/3), then this property is witnessed 
by taking 6 = 2/3 and letting F consist of a single balanced complete tri- 
partite graph. a is said to be a jump for an integer r > 2 if the analogous 
property holds for r-graphs. 

Erdés [24] showed that for all r, every a € (0, a) is a jump and 
conjectured that, like in the case of ordinary graphs, every a € [0,1) isa jump. 
And perhaps the most surprising fact about jumps is that this “jumping 
constant conjecture” is not true. The first examples of non-jumps were given 
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by Frankl and Rédl in [32], and a number of other examples followed. All of 


them, however, live in the interval [ao 1), and what happens in between 
(ie., for a € [4, 24)) was a totally grey area. 


Using flag algebras, Baber and Talbot [12] gave the first example of jumps 
in this intermediate interval by showing that all a € [0.2299,0.2316) are 
jumps for r = 3. Their proof also uses a previous characterization from [32] 
that allows to get rid of the condition that G € F must be arbitrarily large 
by considering their Lagrangians instead. Given this reduction, Baber and 
Talbot produced a set F consisting of five 3-graphs on 6 vertices such that 
A(F’) > 0.2316 (F' € F) while 7(F) < 0.2299. It is worth noting that whether 
a = 2/9 is a jump for r = 3 (which was one of the questions asked in the 
original paper by Erdés) still remains open. 


3.5. Induced H-Densities 


So far we predominantly dealt with “normal” Turan densities 7(F), tmin(F), 
i.e. special cases of tH(F), tmin,z(*) when H is a (hyper)edge. In this 
section, on the contrary, we review a few results proven with the help of flag 
algebras in which the graph H is more complicated. 

Triangle-free graphs need not be bipartite. But how exactly far from being 
bipartite can they be? In 1984, Erdés [25, Questions 1 and 2] considered three 
quantitative refinements of this question, and one of them was to determine 
exc, (n, K3). Gyori [38] had a partial result in that direction. 

The asymptotic version of Erdés’s question was solved using flag algebras 
by H. Hatami et al. [40] and Grzesik [37] who independently proved that 


nes(Ks) = 2. (15) 


The standard trick with blow-ups (cf. (10)) immediately implies that 
exo, (50, K3) = 0°. (16) 


Hatami et al. [40] also proved that the infinite blow-up of Cs is the 
only element ¢ € Hom*(A°,R) realizing equality in (15), and that the 
finite balanced blow-up of Cs is the only graph realizing equality in (16). 
Remarkably, the proof of the latter result bypasses the stability approach 
outlined in Sect.3.2. Namely, given a finite K3-free graph G, instead of 
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viewing this graph as a member of a converging sequence, [40] simply 
considers its infinite blow-up ¢q¢ € Homt(A°,R) and directly applies to it 
the asymptotic uniqueness result. 

When n = 5+ a (1 < a < 4), [40] also proved that exc,(n, K3) = 
0°-*(€ + 1)%, the equality being attained at almost balanced blow-ups of 
Cs. But this proof already uses the traditional stability approach, and, as a 
consequence, works only for sufficiently large n. 

Somewhat similar in spirit is another question asked by Erdés in [23]. The 
Ramsey theorem is equivalent to the statement that for any fixed k,¢ > 0, 
for all sufficiently large n we have expin,r,(n, Ke) > 0. Erdés asked about the 
quantitative behavior of this function and conjectured that the minimum is 
attained for the balanced (¢ — 1)-partite graph. Asymptotically, if we let 


def 
Che = Tmin,t, (Ke), 


Erd6s’s conjecture says that 


Cre = (€- ame (17) 


This was disproved by Nikiforov [56] who observed that the blow-up of Cs we 
just discussed in a different context actually implies that c4.3 < x. Moreover, 
Nikiforov showed that Erdés’s conjecture (17) can be true only for finitely 
many pairs (k, £). 

Using flag algebras, Das et al. [20] and Pikhurko [60] independently 
proved that c3,4 = 1/9 (thus confirming Erdés’s conjecture in this case) 
and that c4,3 = 3/25 (thus showing that Nikiforov’s counterexample is the 
worst possible). Both papers use the stability approach to get exact results 
for sufficiently large n. Das et al. [20, Sect.6] states that their unverified 
calculations confirm Erdés’s conjecture in two more cases: c3,5 = 1/16 and 
3,6 = 1/25. Both these calculations were verified by Vaughan (referred to in 
[60]) who also confirmed Erdés’s conjecture in one more case: c3,7 = 1/36. 
Along the other axis, Pikhurko [60] calculated cs5,3, c¢,3 and c7,3. 

Let us now discuss “pure” inducibility i(H) of a graph/orgraph/hypergraph 
H that in our notation is simply equal to 7y(Q). 

There is one self-complementary graph on 4 vertices, Py and five 
complementary pairs, which (since my(0) = 7q(@)) give rise to six different 
problems of determining 74(@). One of them (K4/JI4) is trivial, and two 
problems had been solved before with other methods. 

Using flag algebras, Hirst [42] solved two more cases: he showed that 


72 
TWKy—Ko (0) = 125 
and that 
3 
TK4—P3 (0) = 6. 


8 
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Thus, now FP, is the only remaining graph on 4 vertices whose inducibility is 
still unknown. 

Sperfeld [72] studied inducibility for oriented graphs. Using flag algebras, 
he showed that 7c, (0) = $ and obtained a few non-exact results improving on 
previous bounds: mp,(0) < 0.4446 (the conjectured value is 2/5), 7c, (0) < 
0.1104 and mK, ,(0) < 0.4644. Then Falgas-Ravry and Vaughan [33] were 
able to actually compute the latter quantity: 


TKi,2 (0) — 2V/3 — 3. 


They also computed 7K, ,(@) that turned out to be a rational function in a 
root of a cubic polynomial. 

In the department of 3-graphs, the same paper [33] calculated the 
inducibility of Go: 


rete 7 


Slightly stretching our notation, let tm.%(0) be the minimal induced density 
of the collection of all 3-graphs on m vertices with exactly k edges (thus e.g. 
Ta, (0) = 74.2(0)). Falgas-Ravry and Vaughan also proved in [33] that 


75.1(0) = 75.9(0) = ; 


and 


75.4(0) = = 


3.6. Miscellaneous Results 


A graph H is common if the sum of the number of its copies (not necessarily 
induced) in a graph G and the number of such copies in the complement of 
G is asymptotically minimized by taking G to be a random graph. Erdés [23] 
conjectured that all complete graphs are common, and this conjecture was 
disproved by Thomason [73] who showed that for p > 4, K, is not common. 
It is now known that common graphs are very rare, and several authors 
specifically asked if the wheel W; shown on Fig. 2 is common. 

This question becomes amenable to the (manifestly induced) frame- 
work of flag algebras by using the transformation (1). And, indeed, H. 
Hatami et al. [41] proved that Ws is common. This is the only result in 
our survey where optimization takes place over a rather complicated linear 
combination of “primary” induced densities rather than individual densities. 
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W; 


5 


Fig. 2) The 5-wheel. 


Erdés et al. asked in [22] a question that later became known as the (2/3)- 
conjecture. Given a 3-coloring of the edges of K,,, what is the smallest ¢ such 
that there exists a color c and a set A of t vertices whose c-neighborhood has 
density at least 2/3? The conjecture says that t = 3, the previously known 
bound was ¢t = 22, and using flag algebras, Kral’ et al. proved in [46] that 
one can actually take t = 4. 

In [47], Kral’, Mach and Sereni looked at the following geometric problem 
resulted from the work by Boros and Fiiredi [5] and Barany [3]. What is the 
minimal constant cq such that for every set P of n points in R@ in general 
position there exists a point of R? contained in at least cq ( a d-simplices 
with vertices at the points of P. As stated, it is not amenable to the approach 
of flag algebras, but Gromov [36] was able to find a topological approach to 
it, and its later expositions (see [47] for details) brought it rather close to that 
realm. One remaining concepts that still can not be handled by flag algebras 
in full generality is that of Seidel minimality as it quantifies over arbitrary 
sets of vertices. Kral’, Mach and Sereni, however, showed that by applying 
this property only to certain sets definable in this language they can improve 
known bounds on cq. 

In the rest of this section we review a few more results about 3-graphs 
obtained with the method of flag algebras that were not addressed in our 
previous account. This activity started with the Mubayi challenge when all 
exact results presented to the author by Dhruv Mubayi found their new flag- 
algebraic proofs in [65, Sect.6.2] of varying and surprisingly unpredictable 
computational difficulty. Razborov [65] also gave a few non-exact results, of 
which we would like to mention here only 7(G3) < 0.2978 later improved by 
Baber and Talbot [12] to 7(G3) < 0.2871, which is already quite close to the 
conjectured value 2/7 (see (14)). 

Baber and Talbot [13] go over “critical” densities 2/9, 4/9, 5/9, 3/4 (recall 
that (P32) = 4/9 and 5/9, 3/4 are conjectured values for 7(K?), 7(K3), 
respectively). For every a from this set they were able to construct one (for 
a = 2/9) or more (for a € {4/9,5/9,3/4}) 3-graphs F with a(F) = a. 
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Falgas—Ravry and Vaughan [34] proved, besides the results we already 
cited above in various contexts, several more exact results: 


12 
1(G3,Cs5, F3.2) = 79° 
5 
G3, F: =— 
( 35 3,2) 18” 
3 
1(J4, F3,2) = 3 


In the second paper [33] of the same authors they prove (again, in addition 
to what we already surveyed before) several more inducibility results: 


16 27 3 
1G@5(K3) = a7” ™Ga(F3.2) = Gy Mg (F3,2) = a5 


9 
TGo (Cs, F3.2) aa 


5 4 
Te TG@2(G3,F3.2)= 9, Ma,(G3,Cs, F3,2) = 5° 


9 ? 

Two forthcoming papers study codegree density m2(F) for 3-graphs 
(see [45, Sect. 13.2] for definitions). Falgas-Ravry, Marchant, Pikhurko and 
Vaughan give a new flag-algebraic proof of the result 72(F3,2) = 2/3 from 
Marchant’s thesis. In the second paper, Falgas—Ravry, Pikhurko and Vaughan 
prove that 72(G3) = 1/4. 


4. Concluding Remarks 


4.1. Flagmatic Software 


In the first few years since the inception of the method, researchers who 
needed it for their work had to write the code on their own, the only thing 
that was available from the shelf were SDP-solvers like CSDP [10] or SDPA. 
It appears as if these homemade pieces of software greatly differ in the level 
of their public availability, user-friendliness and, most important, versatility. 
Like in many other similar scenarios, it can be expected that this period 
of anarchy will eventually be over, and the separation between users and 
developers (with the clear understanding that these two groups are likely 
to overlap significantly) will be defined more clearly. This will also likely 
imply that the many ad hoc programs around will give way to one or a few 
“standard” packages, and researchers that are new to the method will largely 
lose the initiative to program on their own. 

One very serious bid to become such a “golden standard” has been made 
by the Flagmatic software developed by Vaughan and, in fact, many results 
that we surveyed above were obtained using this program. It is publicly 
available from http://www.flagmatic.org, and (from everything I know) it 
is user-friendly. Versatility is also improving: while this project started with 
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3-graphs, the last version 2 also has support for ordinary graphs and oriented 
graphs. Time will tell if Flagmatic gets a serious competitor, but at the 
moment this seems to be the only option for a researcher who needs to use the 
method on a reasonably recurrent basis but does not want to invest time into 
writing his/her own code. 


4.2. Beyond Turan Densities? 


Turan densities for dense graphs is by far not the only area in discrete 
mathematics and beyond where Cauchy—Schwarz and positive semi-definite 
programming are used extensively. Thus, it is natural to wonder if formal 
methods similar to flag algebras can be applied elsewhere. In cases we 
potentially have in mind it is more or less clear how to come up with a 
mathematically beautiful calculus that “works in theory”. But our question 
is more pragmatic: can it be done in such a way that it will actually allow to 
prove new concrete results in the area in question. See [34, Sect. 4.1] for a very 
relevant discussion of the complexity barrier that (as we believe) prevents us 
from getting many more, and possibly very great, results with this method 
even on its home field, asymptotic extremal combinatorics. 

We are aware of two moderate but concrete and successful steps in that 
direction. Baber [1, Chap. 2.5] (some of these results were later independently 
rediscovered by Balogh et al. in [6]) extends the method to Turan densities 
for subgraphs of the hypercube Q,,. The latter is a rather sparse graph, 
so significant modifications are necessary. And Norin and Zwols (personal 
communication) started considering applications of the flag algebra frame- 
work to the study of crossing numbers, particularly of the complete bipartite 
graph K,,,,. They already were able to get a numerical improvement on the 
previously best known bound from [21]. 

One more paper that might be mentioned here is the work by Kral’, 
Mach and Sereni [47] on the Boros—Fiiredi—Baraény problem that we already 
discussed in Sect. 3.6. But their approach is sort of opposite: they “massage” 
the problem they are interested in until it fits nicely the framework of flag 
algebras as originally defined in [63]. 


Acknowledgements I am grateful to Rahil Baber, Victor Falgas-Ravry, Ron 
Graham, Sergey Norin and Christian Reiher for useful remarks. 
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Summary. Let k& be an integer. A graph G is k-arrangeable (concept introduced 
by Chen and Schelp) if the vertices of G can be numbered v1, v2,...,Un in such a 
way that for every integer i with 1 < i < n, at most k vertices among {v1,v2,..., vi} 
have a neighbor v € {vi+1, viz2,---,Un} that is adjacent to v;. We prove that for 
every integer p > 1, if a graph G is not 2500(p + 1)*-arrangeable, then it contains 
a K,-subdivision. By a result of Chen and Schelp this implies that graphs with 
no K,-subdivision have “linearly bounded Ramsey numbers,” and by a result of 
Kierstead and Trotter it implies that such graphs have bounded “game chromatic 
number.” 


The Theorem 


In this paper graphs are finite, may have parallel edges, but may not have 
loops. We begin by defining the concept of admissibility, introduced by 
Kierstead and Trotter [8]. 

Let G be a graph, let M C V(G), and let v € M. A set A C V(G) is 
called an M-blade with center v if either 


(i) A= {a} and a € M is adjacent to v, or 
(ii) A = {a,b}, ae M, b € V(G) — M, and b is adjacent to both v and a. 


An M-fan with center v is a set of pairwise disjoint M-blades with center v. 
Let k be an integer. A graph G is k-admissible if the vertices of G can be 
numbered v1, v2,...,Un in such a way that for every i = 1,2,...,n, G has no 
{v1,v2,...,u;}-fan with center v; of size k +1. 

As pointed out in [8] the concepts of arrangeability and admissibility are 
asymptotically equivalent in the sense that if a graph is k-arrangeable, then 
it is 2k-admissible, and if it is k-admissible, then it is (k? —k+1)-arrangeable. 
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Let p be an integer. A graph G has a K,,-subdivision if G contains p 
distinct vertices v;, v2,...,Up and (8) paths P;; (i,j =1,2,...,p, i<j) such 
that P,;; has ends v; and v;, and if a vertex of G belongs to both Pj and Pj; 
for (i, 7) 4 (#’, 7’), then it is an end of both. The following is our main result. 


Theorem 1. Let p > 1 be an integer. If a graph G is not 50p?(p? + 1)- 
admissible, then it has a K,-subdivision. 


We first prove Theorem 1, and then discuss its applications. For the proof 
we need the following result, originally proved with a larger constant by 
Bollobés and Thomason [3], and independently by Komldés and Szemerédi [9], 
who proved it with a smaller constant, but only for sufficiently large p. Our 
version follows from a result of Thomas and Wollan [11] and can be found 
in [7, Theorem 7.2.1]. 


Theorem 2. Let p> 1 be an integer. If a simple graph on n vertices has at 
least 5p?n edges, then it has a K,-subdivision. 


We first prove a lemma. 


Lemma 1. Let p > 1 be an integer, let G be a graph, and let M be a non- 
empty subset of V(G). If for every v © M there is an M-fan in G with center 
v of size 50p?(p? +1), then G has a Ky-subdivision. 


Proof. Let p,G, and M be as stated in the lemma, and for v € M let Fy bea 
fan in G with center v of size 50p?(p? +1). We may assume that G is minimal 
subject to M C V(G) and the existence of all F,,(v € M). Let |M| =m, let e, 
be the number of edges of G with both ends in M, and let e2 be the number 
of edges of G with one end in M and the other in V(G) — M. Then from the 
existence of the fans F, for v € M we deduce that 2e; + e2 > 50p?(p? + 1)m. 

We claim that if |V(G)— M| > 10p?m— ey, then G has a K,-subdivision. 
Indeed, by our minimality assumption for every w € V(G) — M there exist 
vertices u, v € M such that {u,w} € Fy. For w € V(G) — M let us denote 
by e(w) some such pair of vertices. Let J be the graph obtained from G by 
deleting V(G) — M and for every w € V(G) — M adding an edge between 
the vertices in e(w). Then |E(J)| > 10p?m, and since every pair of vertices 
is joined by at most two (parallel) edges, J has a simple subgraph J’ on 
the same vertex-set with at least 5p?m edges. By Theorem 2 J’ has a K,- 
subdivision L. Every edge of L that does not belong to G joins two vertices 
u, v with {u,v} = e(w) for some w € V(G) — M. By replacing each such 
edge by the edges uw, vw we obtain a K,-subdivision in G. This proves our 
claim, and so we may assume that |V(G) — M| < 10p?m — e1. 

Now |V(G)| < (10p? + 1)m — ej, and 


|E(G)| = ey +eg = 2e1 +€2—€1 > 50p?(p” + 1)m — él 
> 5p? ((10p? + 1)m — e1) > 5p*|V(G)], 


and hence G has a K,-subdivision by Theorem 2, as required. 


Arrangeability and Clique Subdivisions 235 


Proof of Theorem 1. Let p be an integer, and let G be a graph on n vertices 
with no K,-subdivision. We are going to show that G is 50p?(p? + 1)- 
admissible by exhibiting a suitable ordering of V(G). Let i € {0,1,...,n} 
be the least integer such that there exist vertices vj41,Vj42,.--,Un with the 
property that for all 7 = 7,i+1,...,n,G has no (V(G)—{v;41, vj42, +--+, Un})- 
fan with center v; of size 50p?(p? +1). We claim that i = 0. Indeed, otherwise 
by Lemma 1 applied to M = V(G)— {vj41, vita,---,;Un} there exists a vertex 
v; with no M-fan with center v; of size 50p?(p? +1), and so the sequence w, 
Vit+1,-+++;Un contradicts the choice of 7. Hence 7 = 0, and vj, v2,...,Un is the 
desired enumeration of the vertices of G. 


Applications 


We now mention two applications of Theorem 1. Let G be a class of graphs. 
We say that G has linearly bounded Ramsey numbers if there exists a constant 
c such that if G € G has n vertices, then for every graph H on at least cn 
vertices, either H or its complement contain a subgraph isomorphic to G. 
The class of all graphs does not have linearly bounded Ramsey numbers, but 
some classes do. Burr and Erdés [4] conjectured the following. 


Conjecture 1. Let t be an integer, and let G be the class of all graphs 
whose edge-sets can be partitioned into t forests. Then G has linearly bounded 
Ramsey numbers. 


Chvatal, Rodl, Szemerédi and Trotter [6] proved that for every integer 
d, the class of graphs of maximum degree at most d has linearly bounded 
Ramsey numbers, and Chen and Schelp [5] extended that to the class of k- 
arrangeable graphs. Chen and Schelp also showed that every planar graph has 
arrangeability at most 761, a bound that has been subsequently lowered to 10 
by Kierstead and Trotter [8]. From Chen and Schelp’s result and Theorem 1 
we deduce 


Corollary 1. For every integer p > 1, the class of graphs with no Ky- 
subdivision has linearly bounded Ramsey numbers. 


For the second application we need to introduce the following two-person 
game, first considered by Bodlaender [2]. Let G be a graph, and let ¢ be 
an integer, both fixed in advance. The game is played by two players Alice 
and Bob. Alice is trying to color the graph, and Bob is trying to prevent 
that from happening. They alternate turns with Alice having the first move. 
A move consists of selecting a previously uncolored vertex v and assigning 
it a color from {1,2,...,¢} distinct from the colors assigned previously (by 
either player) to neighbors of v. If after |V(G)| moves the graph is (properly) 
colored, Alice wins, otherwise Bob wins. More precisely, Bob wins if after less 
than |V(G)| steps either player cannot make his or her next move. The game 
chromatic number of a graph G is the least integer t such that Alice has a 
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winning strategy in the above game. Kierstead and Trotter [8] have shown 
the following. 


Theorem 3. Let k and t be positive integers. If a k-admissible graph has 
chromatic number t, then its game chromatic number is at most kt + 1. 


They have also shown that planar graphs have admissibility at most 8, and 
hence planar graphs have game chromatic number at most 33 by Theorem 3 
and the Four Color Theorem [1]. From Theorems 1 to 3 we deduce 


Corollary 2. Let p be a positive integer. Then every graph with no Ky- 
subdivision has game chromatic number at most 500p*(p? +1) +1. 
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Dedicated to Paul Erdés 


Summary. We prove that double exponentiation is an upper bound to Ramsey’s 
theorem for colouring of pairs when we want to predetermine the order of the 
differences of successive members of the homogeneous set. 


The following problem was raised by Jouko Vaananen for model theoretic 
reasons (having a natural example of the difference between two kinds of 
quantifiers, actually his question was a specific case), and propagated by Joel 
Spencer: Is there for any n, c an m such that 


(x) For every colouring f of the pairs from {0,1,...,m— 1} by 2 (or even 
c) colours, there is a monochromatic subset {ao,...,@n—1}, @o < a1 <-> 
such that the sequence (a;41 — a; : 7 <n-—1) is with no repetition and is 
with any pregiven order. 


Noga Alon [1] and independently Janos Pach proved that for every n, c there 
is such an m as in (x); Alon used van der Waerden numbers (see [2]) (so 
obtained weak bounds). 

Later Alon improved it to iterated exponential (Alan Stacey and also the 
author have later and independently obtained a similar improvement). We 
get a double exponential bound. The proof continues [4]. Within the “realm” 
of double exponential in c, n we do not try to save. 

We thank Joel Spencer for telling us the problem, and Martin Goldstern 
for help in proofreading. 

Notation. Let ¢, k, m,n, c, d belong to the set N of natural numbers (which 
include zero). A sequence 7 is (7(0),...,7(€gn —1)), also p, v are sequences. 
71 <lv means that 7 is a proper initial subsequence of v. We consider sequences 
as graphs of functions (with domain of the form n = {0,...,n—1}) but 7Nv 
means the largest initial segment common to 7 and v. 7~(s) is the sequence 


(n(0),.--,7(€gn — 1), s) (of length £g(7) +1). 
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Let ’m := {7 : €g(n) = 2, and Rang(n) C {0,...,m—1}}, m= U *m. 
k<e 
For v € ‘*m we write [v]z,, (or [v] if 2 and m are clear from the context) for 
the set {7 € 'm:v <n}. 
[AJ” = {w C A: |w| = n}. Intervals [a,b), (a,b), [a,6) are the usual 


intervals of integers. The proof is similar to [4] but sets are replaced by trees. 


m 


1. Definition 


r =r(n,c) is the first number m such that: 


(x)™° for every f : [{0,...,m-—1}]? > {0,...,c—1} and linear order <* on 
{0,...,n—2} we can find ag <... < @p_1 € [0,m) such that: 

(a) fl{ai,aj}:0<i<j <n} is constant. 

(b) The numbers bg := ae41 — ae (for £ <n —1) are with no repetitions and 
are ordered by <*, i.e. 1 <* 7 = bj < b;. 
We will find a double exponential bound for r = r(n,c), specifically, 

r=r(n,c) < 20(+)")"* (go our bound is double exponential in n and in c). 
This is done in Sect. 6. Alon conjectures that the true order of magnitude 


of r(n,c) is single exponential, and Alon and Spencer have proved this for 
the case where the sequence (a;+1 — aj : i << _m—1) is monotone. 


2. Definition 


We say S is an (€,m*,m,w)-tree if: 


(a) uC {0,1,...,@-—1} and m* >m 

(b) $C (m") 

(c) S is closed under initial segments 

(d) If v € S is <-maximal then @g(v) = @ 

(e) Ifu € S and ég(v) € u then for m numbers j < m* we have v~(j) € S 


3. Claim 


Suppose k, ’,m,p,m* € N satisfy 


Viimeows m* > om, 


then for every i(*) < , and u C [0,@—1), and |u| < p and f : [£(m*)]? = 
{0,1} there is T C’* (m*) closed under initial segments, (T,<) = (“m,<) 
satisfying 
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@ for every m,...,m~ € T° (m*) with (m | i(*),...,m% | i(*)) pairwise 
distinct we can find a set S such that: 


(a) S is an (€,m*,m, u i(*))-tree 
(b) If j € [1,k) and v € SM (m*) then f({nj,ne}) = F(Km.7}) 
(c) vE SM! (m*) > ne File) sv 


Remark 1. (1) With minor change we can demand in @Q “for any 
i(x) << L”. 
(2) We could use here f with range {0,...,c—1}, and in claim 8 get a longer 
sequence (ve : £ < n*) such that f({ve,,ve,}) depends just on ve, N ve, 
then use a partition theorem on such colouring. 


Proof. For each n €'* (m*) choose randomly a set Ay C [0,m*),|A,| = 
m,A, = {a],...,2"} (pairwise distinct, chosen by order) (not all are 
relevant, some can be fixed). 

We define T = {n: 7 € (m*), and i < lg(n) > n(i) € Anji}- 

We have a natural isomorphism h from ‘2m onto T: 


=ne A no=2%, 
i<lgv 
Our problem is to verify Q, we prove that the probability that it fails is < 1, 
this suffices. 
We can represent it as: 


(+), if 4,...,y% € “m and m | i(«),...,v, | i(*) distinct, then for 


h(v1),...,h(v~) there is S as required there. 


So it suffices to prove ae for any given such i(*) < @ and 14,...,v% the 
probability of failure is < —+~ = ) as it suffices the demand in  ( ), to 


a 


hold for the minimal suitable i(«). Without loss of generality we may assume 
u = u)\i(*). For this we can assume 2 are fixed whenever —[h(v;,) | i(*) <p] 
or lg(p) ¢u. 

Let Y = {n € *(m*) : Prob[h(v,) = n] 4 0}, so |Y| = ml, 

So A(v),...,h(ve_-1) are determined. Now h(v1),...,h(ve_-1) and f 
induces an equivalence relation E on Y: 


k-1 
nf En" iff f\ f({h(v;),0'}) = FLY), 09). 
jel 
The number of classes is < 2*~1, let them be A1,..., Aoz—1 (they are pairwise 


disjoint, some may be empty). 

We call A; large if there is S as required in clauses (a) and (c) of & such 
that (Vp)[o € SM *(m*) > pé€ Aj]. 

It is enough to show that the probability of h(v,) belonging to a non-large 
equivalence class is < —, hence it is enough to prove: 


(". ) 
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(*)2 Aj; not large > Prob(h(v,) € Aj) < ta 
bk )X 
So assume Aj; is not large. Let Y* := {n |i: 1 €Y andi < #}. 
n € Y*: there is S C Y* satisfying 
a (a)’ Sis an (€,m*,m,u \ £g(n)) — tree 
eS 


(b)' ve Sn*(m*)>n<4v 
(c)' for every v € SN *(m*) we have v € Aj 


Let Z* := {n € Z;: there is no 7! <n, 7 € Z;}. 
Clearly h(v% [ i(*)) € Z; (as Aj is not large) hence h(vj, | i(*)) ¢ Z7. 
Clearly 


(x)3 for 7 € Y*\ Z; such that &g(n) € u we have 
|{i : nt) € ZF (or even € Z;)}| < m. 


But ifv <n, v € Z; then n ¢ Z;. Hence 
(*)4 for 7 € Y* such that ¢g(7) € u we have 


{a : nt) € ZFS] <m. 
Now 


(«)5 if 7 © A; (hence 7 € Y; remember that A; is not large) then, 7 € Z;, 


hence V n [ (§ +1) € Z}. 
jeu 


So 


Prob(h(v,) € Aj) < Prob( VV (hv, FG + 1))€ z;) 


jeu 


< $7 Prob(h(ve | (7 +1) € Z7)) 


jeu 
m 
< |u| x iP 


(first inequality by (*)5, second inequality trivial, last inequality by (*)4 
above). So it suffices to show: 


equivalently 


U 
m? > Ju) xm x ("1 ) x24 


as (™") < m*/k!, and |u| < p by the hypothesis (*)%,2,m,p,m* we finish. 
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4. Lemma 


Assume 


@) P1y---,Pn €™ + 2 are distinct, for €€ {2,...,n}, we have rp € {1,...,£— 
1} such that lg(peNp,,) = @—-landr € {1,...,@-1}\{re} > la(peNpr) < 
g-1 

b) f : [’m)? > [0,c) = {0,...,c-} 

c) m= g(nt1)ern 


Then we can find 7,..., 1%, € ’m such that: 


a) fl [{m,---,;m}} is a constant function 
(B) (€9(nit1 A Mig.) 1 2 = 1,...,2—1}) is a sequence with no repetitions 
ordered just like (€9(pi41 pr,4,) 14 = 1,n — 1}); also: 


nita(lg(ni+1 a Nias) < Triad (€g(ni a Nrigs)) 
= piti(lg(pit1 O Prigi)) < Prigs (C9(0g(Pit1 O Pris, ))- 
Remark 1. (1) Note that iff C"~? 2, |[| =n and the set {pi1Np2: pi F pe 
are from T} has no two distinct members with the same length then we can 
list T as (p1,..., Pn) as required in clause (a) of Lemma 4. 
(2) So if <* is a linear order on {1,...,n —1} then we can find distinct 


Pi;-++;Pn €™ + 2 as in clause (a) of Lemma 4 and a permutation o of 
{1,...,n} such that: fori #7 € {1,...,n—2} we have 


i<" jf iff Co(poriy M Po(i41)) > £9(Po(j) M Po(y+41))- 


(E.g. use induction on n.) 


Proof. Let us define (m,; : 2 <j < en) by induction on j: 


m2=n, M41 = Cs 
Check that m; < g(r so in particular men) < m. Now we claim that 
for any number d € [1,c] the following holds: 


®, Assume g € [0,n°) such that q divisible by n? and q+n4 < en, u = [q,q+ 
n*), T* is an (€,m,mq+an),u)-tree and f is a function from [Z* M ‘m|]? 
with range of cardinality d. Then we can find m,...,1%m € T* (£m) such 
that clauses (a) and (8) of the conclusion of Lemma 4 hold. 


This suffices: use g = 0, d= c. We prove this by induction on d. If d= 1, 
trivial as only one colour occurs. For d+ 1 > 1, without loss of generality 
Rang(,) = [0, d], let f” : [£m]? — {0,1} be f"({n'sn'"}) = Minf f ({nl.77")}, L- 
Let for j < n, uj := [q+n*j, q+n4j+n*). By downward induction on j € [1,7] 
we try to define T; such that: 


(i) Tj is an (€,m,mq+na+j), U ua)-tree 
tJ 
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(ii) Tj S Tj41 

(iii) For every 7 € [1,n — 1] and m,...,n; € Tj; Am with (m | (q+ 
n¢j),...,7; | (q+n%j)) pairwise distinct, we can find n! A nr € Tj410°m 
such that: 


nj | (q+n%j) <7,7" 
lg(n' Nn") € uy 
f'({1',n"}) =0 


\ (0=f' Une n}) +0 = f'n} =f nen"))- 


te (1,3) 
This suffices as then we can choose by induction on j = 1,...,m a sequence 
v; € T; 1 ('m) such that (after reordering) the set {v1,...,Un} will serve 


as {m,---, 7m} of ®, (with the constant colour being zero). Let us do it in 
detail. 


By induction on j = 1,..., we choose vi, a sty such that: 
(a) vi,... vf are distinct members of T; M (“m) 
(b) ft [{vj,---,0j}] is constantly zero 


(c) For € = {2,...,j} we have fg(v} 0 vi,) € Ure — 1 
For j = 1 no problem. In the induction step, i.e. for 7 + 1, we apply the 
condition (iii) above with (vz : € € [1,3], # qj+1)~(vj,,,) here standing 
for m,...,7; there (we want Ohi be the last), the condition (ne | (q + 
nj) :£=1,...,j) with no repetition follows by clause (c), so we get 1’, 
n! € Tj410('m) as there. W.l.0.g. 7/(€9(n' Nn") <n" (Lg(n! 7"). 

We now define wt for £€=1,...,7 +1: 

Ife {1,...,f +1} \ (G41, 7541} then vj? = v3 (remember u CT): 

If 541 (€9(P541 N Pr341)) < Prj41(€9(P541 N Prj41)) then UT = 7’ and 


a 


If Prj4a (€9(pj41 a Biseqs)) < pj+i(lg(pj41 a Pisin) then ae = ne and 
pith ll. 


us . 
Now check. (Note Tp, uo could be omitted above.) 


Carrying the Inductive Definition. For 7 = n, trivial : let T, = T* 


(given in ®),). 
For j € [2,n), where Tj41,...,Zp are already defined, we apply Claim 2 
with 
Mat+nd+j+1s, Mq+nd+js nj + 1), ds q+ nj, lq + nj, q+ nj oF n*), nj 
here standing for 
m*, ™m, £; k, i(*), U, Pp 


there. (Ie. the tree in Claim 2 is replaced by one isomorphic to it, levels 
outside U;_; ui can be ignored.) 


A Finite Partition Theorem with Double Exponential Bound 243 


So we need to check mg+na+j+1 > n4j(Mesnaeg)” GDI, which holds 
by the definition of the m;’s (as d < c—1). But we require above more than in 
Claim 2 (preferring the colour 0). But if it fails for T; then for some 7,...,7; 
in T; we have S' as in © of Claim 2, with no 7! 4 7? € SM ‘m such that 
f'({7',7"}) = 0. On S' we can apply our induction hypothesis on d—allowed 
as the original f misses a colour (the colour zero) when restricted to S. O3 


5. Fact 


Let (nj : i < (2m —1)*) enumerate ‘(2m — 1) in lexicographic order. Let 
A = {0,2,...,2m— 2} C [0,2m — 1),so |A| =m, 
B:= {i < (2m—1)°: m € A}. 
Let 
lowm,e(k) = (2m — 1)*-*-1, and 
highm,e(k) = (2m — 1)*-* - 1. 


(0) Ift #7 are in B, |n; 1 n;| = k then: 

i < J iff mi(k) < ny). 

(1) Ift <j are in B, |n,1n,| = k, then: 

lowm,e(k) < j —1 < highm,e(k) 


(2) If ti: < fi, t2 < jo are all in B, and |n;, 97;,| ¥ |ni2 A752, then: 
A- uy <jo— tg iff Hie FY | > Nig Ojo |- 
Proof. (0) Check. 


1) Let v := An; and k := |v|. We are looking for upper and lower bounds 
5 
of the cardinality of the set (the order is lexicographic) 
C= {ne *(Q2m—-1):m% <9 < 75}. 


Clearly each 7 € C satisfies v <n. Moreover, since 7; € ‘m, each element 
n € C must satisfy n(k) < nj(k) < 2m— 1. Hence 


Co LU b()eem-—1y \{n5} 
s<2Q2m—-1 
so we get |C| < (2m —1)- (2m —1)*-k-! —1 = (Qm—1)* - 1. 
For k = €—1 the lower bound claimed in (1) is trivial, so assume k < £—2. 


Let 1’ := (m | &)~ (mi(k)+1) (note: as 7; (k) < 7;(k) are both in A, necessarily 
u'(k) < nj(k)). Then we have 
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GS U [v'~(s)](¢(am—1)) U {ni}, 


s<2m—1 
so |C| > (2m —1)*-*-1 +1. 
Proof of (2): Check that high e(k +1) < lowm,¢(k), and use (1). 4 


Remark. Also lowm,e(k) = (2m —1)*-*-1! + 1 is O.K. but with the present 
bound we can use only 7; with n(€—1) <_m, B= {i: n; | (@—1) €f A}. So 
(2m — 1)* can be replaced by m(2m — 1)*?. 


6. Conclusion 


If f : [(0,(2m —1)*)F] > [0,c), 2 := ne, m := soe then we can 
find ag < ... < Gn—-1 < m such that f | [{ai,...,a@n—1}]? constant and 
(aj41 — a; i <m-—1) in any pregiven order 


Proof. As in Fact 5, let (m : i < (2m — 1)*) enumerate ‘(2m — 1) in 
lexicographic order, and let B := {i < (2m —1)°: n; € “m} and fori € B 
define ni by ni(j) = 2- (7) for 7 < €. So nj is in the set B from Fact 5. 
Define a function f’ : [’m]? > c by requiring fini 53) = £4, 7}) for all 7, 
j € B. Now the conclusion follows from Lemma 4., Remark 1(2) and Fact 5., 
particularly clause (2). 5 
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Summary. Paul Erdés is 80” and the mathematical community is celebrating him 
in various ways. Jarik NeSsetiil also organized a small conference in Prague in his 
honour, where we, combinatorists and number theorists attempted to describe in 
a limited time the enourmous influence Paul Erdés made on the mathematics of 
our surrounding (including our mathematics as well). Based on my lecture given 
there, I shall survey those parts of Extremal Graph Theory that are connected most 
directly with Paul Erdés’s work. 

In Turdn type extremal problems we usually have some sample graphs 1j,..., Lr, 
and consider a graph G,, on n vertices not containing any L;. We ask for the 
maximum number of edges such a G, can have. We may ask similar questions for 
hypergraphs, multigraphs and digraphs. 

We may also ask, how many copies of forbidden subgraphs L; must a graph 
Gn contain with a given number of edges superseding the maximum in the 
corresponding extremal graph problems. These are the problems on Supersaturated 
Graphs. 

We can mix these questions with Ramsey type problems, (Ramsey-Turdn 
Theory). This topic is the subject of a survey by Simonovits and V. T. Sés (Discrete 
Math 229:293-340, 2001). 

These topics are definitely among the favourite areas in Paul Erddés’s graph 
theory. 


Keywords Graphs, Extremal graphs, Graph theory 


1. Introduction 


Extremal graph theory is a wide and fast developing area of graph theory. 
Having many ramifications, this area can be defined in a broader and in 
a more restricted sense. In this survey we shall restrict our considerations 
primarily to “Turan Type Extremal Graph Problems” and some closely 
related areas. 

Extremal graph theory is one of the wider theories of graph theory and 
— in some sense — one of those where Paul Erdés’s profound influence can 
really be seen and appreciated. 


Remark [N] 2. Recently, several new schools emerged in this topic, and 


also, mostly in the last 10-15 years, a completely new approach emerged, 
which, 


(a) On the one hand, connects discrete methods with continuous ones, 

(b) On the other hand, generalizes the “classical notion” of Extremal Graph 
Problems and often tries to use a “Calculus”; further the new schools 
occasionally use computers in solving extremal problems. 


Here the reader is refered to Lovasz’ book [193] or to [295], or to Lovasz’ 
homepage, and also to Razborov’s works [311], or [271, 272]. 


? This refers to the time of the conference, not to when this volume appears. 
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Actually, there are two distinct theories here: the theory of dense and the 
theory of sparse graphs (see e.g. [193, 248, 249]) but we skip the sparse case. 


What Is a Turan Type Extremal Problem? 


We shall call the Theory of Turan type extremal problems the area which — 
though being much wider — still is originated from problems of the following 
type: 

Given a family £ of sample graphs, what is the maximum number of edges 

a graph G,, can have without containing subgraphs from CL. 


Here “subgraph” means “not necessarily induced”. In Sect. 11 we shall 
also deal with the case of “excluded induced subgraphs”, as described by 
Promel and Steger. 

Below K,, C;, and P; will denote the complete graph, the cycle and the 
path of t vertices and e(G) will be the number of edges of a graph G. G, 
will be a graph of n vertices, G(X,Y) a bipartite graph with colour classes 
X andY. 

The first result in our field may be that of Mantel [132] back in 1907, 
asserting that if a graph G,, contains no K3, then 


xa [*). 


Mantel’s result soon became forgotten. The next extremal problem was the 
problem of C4 (Erdés [46]). 


The C,-Theorem and Number Theory 


In 1938 Erdés published a paper [46] 


P. Erdés: On sequences of integers no one of which divides the product of 
two others and related problems, Mitt. Forsch. Institut Mat. und Mech. 
Tomsk 2 (1938) 74-82. 


In this paper Erdés investigated two problems: 


(A) Assume that ni <--- < nx are positive integers such that n; does not 
divide n,n, except if either i = h or i = €. What is the maximum number 
of such integers in [2,n]? Denote this maximum by A(n). 


Let a(n) denote the number of primes in [2,n]. Clearly, the primes 
in [2,n] satisfy our condition, therefore A(n) > a(n). One could think 
that one can find much larger sets of numbers satisfying this condition. 
Surprisingly enough, the contrary is true: Erdés has proved that A(n) » 7(n). 
More precisely, 
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n2/3 n2/3 
n(n) + ——,— < A(n) < a(n) +O (=) , 
80 log* n log” n 


For us the other problem of [46] is more important: 


(B) Assume that n; < +--+ < nx are positive integers such that nin; A np ne 

unless {i,j} = {h,¢}. What is the maximum number of such integers in 

[1, n]? Denote this maximum by B(n). 

Again, the primes of [2,] satify this condition. Here Erdés proved that 
cn3/4 


(log n)3 
Later Erdés improved the upper bound to 


n3/4 
B(n) < m(n) + O | ——— ], 
in) in) (an) 


a(n) + 


< B(n) < r(n) +O (ni/*) ; 


see [58]. It is still open if, for some c ¥ 0, 


en3/4 
B(n) = a(n) + (1+ (1) Gears 


or not. 

Solving this unusual type of number theoretical problem, Erdés (probably 
first) applied Graph Theory to Number Theory. He did the following: Let D 
be the set of integers in [1,n?/%], P be the set of primes in (n?/3,n] and 
B=DUP. 


Lemma (Erdés). Every integer a € [1,n] can be written as 
a = bd, where bE B, de D. 


Let A be a set satisfying the condition in (B). Let us represent each a € A 
as described in the Lemma: a; = 6j(;)d;(7). We may assume that bj(3) > dry). 
Build a bipartite graph G(B,D) by joining b; to d;, if a = b;d, € A. Thus we 
represent each a € A by an edge of a bipartite graph G(B, D). Erdés observed 
that 


the number theoretic condition in (B) implies that Cy Z G(B,D). 
Indeed, if we had a 4—cycle (bidibed2) in G(B,D), then a1 = bid1, ag = 
d1b2, a3 = bed2 and a4 = db, all would belong to A and a,a3 = azga4 would 


hold, contradicting our assumption. So the graph problem Erdés formulated 
was the following: 


Given a bipartite graph G(X, Y) with m and n vertices in its colour classes. 
What is the maximum number of edges such a graph can have without 
containing a C4? 


Erdos proved the following theorem: 


Theorem 1.1. If Cy, Z G(X, Y), |X| =|Y| =k, then 


Paul Erdos’ Influence on Extremal Graph Theory 249 


e(G(X,Y)) < 3k°/?. 


Here the constant 3 is not sharp (see Sect.4). Basically this theorem 
implied the upper bound on B(n). To get the lower bound Erdés used finite 
geometries. Erdds writes: 

... Now we prove that the error term cannot be better than O (or) : 

First I prove the following lemma communicated to me by Miss E. Klein.* 


Lemma (Eszter Klein). Given p(p +1) +1 elements, (for some prime 
p) we can construct p(p +1) +1 combinations, taken (p+ 1) at a time* 
having no two elements in common. 


Clearly, this is a finite geometry, and this seems to be the first application 
of Finite Geometric Constructions in proving lower bounds in Extremal 
Graph Theory. Yet, Erdés does not speak here of finite geometries, neither 
of lower bounds for the maximum of e(G(X,Y)) in Theorem 1.1. 

In the last years” Erdés, Andras Sarkézy and V. T. Sos started applying 
similar methods in similar number theoretic problems, which, again, led to 
new extremal graph problems, [83]. I mention just one of them: 

(C) Let Fy (N) be the maximum number of integers a, < az <-:: < ay 
in {[1, N] with the property that the product of & different ones is never a 
square. 


Theorem 1.2 (Erdés—A. Sdrk6ézy—T. Sés [83]). There exist a positive 
absolute constant c > 0 and for every ¢ > 0 an No(e) such that for N > No(e) 
we have 


(/2 — e)N2/3 


- _ 7/9 
og’ N < Fe(N) — 2(N) — a(N/2) < cN'* log N. (1) 


Taking all the primes of [2, N] and all the numbers 2p where p is a prime 
in [1, N/2] we get 7(N)+7(N/2) such numbers (satisfying (C)) and the above 
theorem suggests that this construction is almost the best. 

The solution of this problem depends on extremal graph theorems 
connected to excluding Cg. Theorems analogous to Theorem 1.2 hold for 
the even values of k, and somewhat different ones for odd values of k. The 
question which was asked is: 


3 Eszter Klein, later Mrs. Szekeres. 


4 Meaning: p + 1-tuples.... The text here does not tell us if the role of Mrs. 
Szekeres was important here or not, but somewhere else Erddés writes: “Mrs. 
Szekeres and I proved ...” 


° Watch out: I wrote this many years ago! 
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What is the maximum number of edges a bipartite graph G(U,V) with 
u RED and v BLUE vertices can have if G(U,V) contains no Cg and 
uv < N? 


In [83] the following conjecture was formulated: 


Conjecture 1.3. If G(U,V) is a bipartite graph with u = |U| RED vertices 
and v = |V| BLUE ones, and G(U, V) contains no C¢, and v < u < v?, then 
e(G(U,V)) < c(uv)?/. 


De Caen and Székely [34] and Faudree and Simonovits [253] had earlier 
some related estimates. The upper bound (1) (of [83]) has been improved 
first by Gabor Sadrkézy [148]. Then E. Gyéri [268, 269] proved the above 
conjecture, which in turn brought down the upper bound of (1) to the lower 
bound, apart from some log-powers. 


Theorem 1.4 (Gy6ri). If Co, Z G(m,n), then e(G(m,n)) < (k —1)n + 
O(m?). 

Conjecture 1.5 (GyGri). There exists a c > 0 for which, if Co, Z G(m,n), 
then e(G(m,n)) < (k — 1)n+ O(m?~°). 


Perhaps even e(G(m,n)) < (k — 1)n + O(m3/?) could be proved for Ce. 


Further sources to read: Works of Lazebnik, Ustimenko, and Woldar 
should be mentioned here, e.g., [124, 125]. See also Fiiredi [101] 

There is also a similar, related but slightly different branch of combinato- 
rial number theory, where the extremal numbers of C2, play important role 
see e.g., Dietmann, Elsholz, Gyarmati and Simonovits [246]. 


“How Did Crookes Miss to Invent the X-Ray?” 


Erdos feels that he “should have invented” Extremal Graph Theory, back in 
1938. He has failed to notice that his theorem was the root of an important 
and beautiful theory. Two to three years later Turan proved his famous 
theorem and right after that he posed a few relevant questions, thus initiating 
a whole new branch of graph theory. Erdés often explains his “blunder” by 
telling the following story. 


Crookes observed that leaving a photosensitive film near the cathod-ray- 
tube causes damage to the film: it becomes exposed. He concluded that 
“Nobody should leave films near the cathod-ray-tube.” Rontgen observed 
the same phenomenon a few years later and concluded that this can be 
used for filming the inside of various objects. His conclusion changed the 
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whole Physics.° “It is not enough to be in the right place at the right time. 
You should also have an open mind at the right time,” Paul concludes his 
story. 


Erdés’s influence on the field is so thorough that we do not even attempt 
to describe it in its full depth and width. We shall neither try to give a very 
balanced description of the whole, extremely wide area. Instead, we pick a 
few topics to illustrate Erdés’s role in developing this subject, and his vast 
influence on others. 

Also, I shall concentrate more on the new results, since the book of 
Bollobas [12] (see also [183]), or the surveys of myself, [156, 158, 159], or 
the surveys of Fiiredi [102] and Sidorenko [151] provide a lot of information 
on the topic and some problem-papers of Erdés, e.g., [54, 62, 64, 65] are also 
highly recommended for the reader wishing to learn about the topic. Also, 
wherever it was possible, I selected newer results, or older but less known 
theorems (partly to avoid unnecessary repetition compared to the earlier 
surveys). 


I had to leave out quite a few very interesting topics. Practically, 


(a) I skipped all the hypergraph theorems, [102, 151] 

(b) the covering problems connected to the Erdés—Goodman-Pésa theorem 
[251], 

(c) applications of finite geometrical methods in extremal graph theory, see, 
e.g. [163, 158], ... application of Lazebnik—Ustimenko type construc- 
tions, [124] and many more... Actully, I shall return to some of them in 
[188]. 

(d) Among others, I had to leave out that part of Ramsey Theory, which is 
extremely near to Extremal Graph Theory, (see [97]) ...and for many 
other things see Bollobds [12, 15, 16] ... 


“The Complete List of Theorems” 


If one watches Erdos in work, beside of his great proving power and elegance, 
one surprising feature is, how he poses his conjectures. This itself would 
deserve a separate note. Sometimes one does not immediately understand 
the importance of his questions. Slightly mockingly, once his friend, Andras 
Hajnal told to him: “You would like to have a Complete List of Theorems”. 
I think there is some truth in this remark, still one modification should be 
made. 


®° When I asked Paul, why did he think that Réntgen’s discovery changed the 
whole Physics, he answered that Rontgen’s findings had led to certain results of 
Curie and from that point it was only a short step to the A-bomb. 
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Erd6s does not like to state his conjectures immediately in their most 
general forms. Instead, he picks very special cases and attacks first these 
ones. Mostly he picks his examples “very fortunately”. Therefore, having 
solved these special cases he very often discovers whole new areas, and it is 
difficult for the surrounding to understand how can he be so “fortunate”. So, 
the reader of Erdés and the reader of this survey should keep in mind that 
Erdés’s method is always to attack important special cases. 


Notation 


We shall primarily consider simple graphs: graphs without loops and multiple 
edges. Later there will be paragraphs where we shall also consider digraph- 
and hypergraph extremal problems. 

Given a family £ of — so called — excluded or forbidden subgraphs, ex(n, £) 
will denote the maximum number of edges a graph G, can have without 
containing forbidden subgraphs. (Containment does not assume “induced 
subgraph” of the given type.) The family of graphs attaining the maximum 
will be denoted by EX(n,£). If £ consists of a single L, we shall use the 
notation ex(n, ZL) and EX(n, L) instead of ex(n, {L}) and EX(n, {L}). 

For a set Q, |Q| will denote its cardinality.’ Given a graph G, e(G) will 
denote the number of its edges, v(G) the number of its vertices, y(G) and 
a(G) its chromatic and independence numbers, respectively. For graphs the 
(first) subscript will mostly denote the number of vertices: Gn, Sn, Tnp, 
...denote graphs on n vertices. There will be one exception: speaking of 
excluded graphs L,...,, we use superscripts just to enumerate these 
graphs. Given two disjoint vertex sets, X and Y, in a graph Gy, e(X,Y) 
denotes the number of edges joining X and Y. Given a graph G and a set X 
of vertices of G, the number of edges in a subgraph spanned by a set X of 
vertices will be denoted by e(X), the subgraph of G spanned by X is G(X). 


Special graphs. kK, will denote the complete graph on p vertices, Th» 
is the so called Turan graph on n vertices and p classes: n vertices are 
partitioned into p classes as uniformly as possible and two vertices are joined 
iff they belong to different classes. This graph is the (unique) p-chromatic 
graph on n vertices with the maximum number of edges among such graphs. 
K,(n1,-..,2p) (often abbreviated to K(n1,...,n,)) denotes the complete 
p-partite graph with n,; vertices in its ith class, i = 1,2,...,p. 

We shall say that X is completely joined to Y if every vertex of X is 
joined to every vertex of Y. Given two vertex-disjoint graphs, G and H, their 
product G® H is the graph obtained by joining each vertex of G to each one 
of H. 


” Our notation is mostly standard. Below some of the notations are just the 
repetitions of what we wrote rarlier. 
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Quoting. Below sometimes I quote some paragraphs from other papers, but 
the references and occasionally the notations too are changed to comply with 
mines. 


2. Turdn’s Theorem 


Perhaps Turan was the third to arrive at this field. In 1940 he proved the 
following theorem, [174] (see also [175, 173]): 


Theorem 2.1 (Turan). (a) If G, contains no K,, then e(G,,) < e(Tn,p-1). 
In case of equality Gn = Th p-1. 


Turan’s original paper contains much more than just this theorem. Still, 
the main impact coming from Turan was that he asked the general question: 


What happens if we replace Kp with some other forbidden graphs, e.g., 
with the graphs coming from the Platonic polyhedra, or with a path of 
length £, etc. 


Turan’s theorem also could have sunk into oblivion. However, this time 
Erd6s was more open-minded. He started proving theorems, talked to people 
about this topic and people started realizing the importance of the field. 

Turan died in 1976. The first issue of Journal of Graph Theory came out 
around that time. Both Paul [63] and I were asked to write about Turdn’s 
graph theory [156]. (In the introductory issue of the journal Turdn himself 
wrote a Note of Welcome, also mentioning some historical facts about his 
getting involved in graph theory [177].) Let me quote here some parts of 
Paul Erdés’s paper [63]. 

In this short note I will restrict myself to Turan’s work in graph theory, 

even though his main work was in analytic number theory and various 

other branches of real and complex analysis. Turan had the remarkable 


ability to write perhaps only one paper in various fields distant from his 
own; later others would pursue his idea and new subjects would be born. 


In this way Turan initiated the field of extremal graph theory. He started 
this subject in 1941,° (see [174, 175]) He posed and completely solved the 
following problem ... 


Here Erdos describes Turan Theorem and Turan’s hypergraph conjecture, 
and a result of his own to which we shall return later. Then he continues: 


® To be precise, Turén proved his theorem in 1940, in a labour camp, and 
published it in 1941. 
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Turan also formulated several other problems on graphs, some of which 
were solved by Gallai and myself [69]. J began a systematic study of 
extremal graph theory in 1958 on the boat from Athens to Haifa and have 
worked on it since then. The subject has a very large literature; Bollobas 
has written a comprehensive book on extremal problems in graph theory 
which will appear soon. (Paul meant [12].) 


One final remark should be made here. As I stated in other places, Paul 
Turan’s role was crucial in the development of Extremal Graph Theory. Still, 
even here there is a point, where Erdés’s influence should be mentioned 
again. More precisely, the influence of an Erdés—Szekeres paper [94]. As 
today it is already well known, Erdés and Szekeres tried to solve a problem 
in convex geometry [94] and rediscovered Ramsey’s Theorem [145]. They 
informed Turan about their theorem, according to which either the graph or 
the complementary graph contains a large complete graph. Turdn regarded 
this result as a theorem where one ensures the existence of a large complete 
subgraph in G,, by assuming something about the complementary graph. So 
Turan wanted to change the condition and still arrive at the same conclusion. 
This is why he supposed that a lower bound is given on the number of edges 
and deduced the existence of a large complete subgraph of G,,. Turan writes 
in [174]: 

Theorem I gives a condition to guarantee the existence of a complete 

subgraph on k vertices in a graph on a finite number of vertices. The 

only related theorem — as far as I know — can be found in a joint paper of 

Pal Erdés and Gyorgy Szekeres [94] and essentially states that if a graph 

A on n vertices is such that its complement A contains only complete 

subgraphs having “few” vertices, then the graph A contains a complete 

subgraph on “many” vertices. Their theorem contains only bounds in the 


place of the expressions “few” and “many”, in fact it gives almost only the 
existence; the exact solution seems to be very interesting but difficult ... 


Some (Further) Historical Remarks 


(a) Turdn’s paper contains an infinite Ramsey theorem. I quote: 


Theorem II. Let us suppose that for the infinite graph A containing 
countably many vertices P,, P2,... there is an integer d > 1 such that 
if we choose arbitrary d different vertices of A, there will be at least two 
among these vertices joined by an edge in A. Then A has at least one 
complete subgraph of infinitely many vertices. 


This theorem is weaker than the one we usually teach in our 
Combinatorics courses, nevertheless, historically it is interesting to see 
it in Turan’s paper. 
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(b) Turdn’s theorem is connected to the Second World War in two ways. 
On the one hand, Turan, sent to forced labour service and deprived of 
paper and pencil, started working on problems that were possible to 
follow without writing them down. Also he made his famous hypergraph 
conjecture, thinking that would he have paper and pencil, he could have 
easily proved it. 

On the other hand, it is worth mentioning that Turdn’s Theorem was 
later rediscovered by A. A. Zykov [182, 1949] who (because of the war) 
learned too late that it had already been published. 

(c) As to Mantel’s result, I quote the last 4 lines of Turan [174]: 

Added in proof. ... Further on, I learned from the kind communi- 
cation of Mr. Jozsef Krausz that the value of d,(n) (= ex(n, K3)) is 
given on p. 438, for k = 3 was found already in 1907 by W. Mantel, 


(Wiskundige Opgaven, vol. 10, p. 60-61). I know his paper only from 
the reference of Fortschritte d. Math., vol 38 p. 270. 


(d) During the war Turdn was trying to prove that either a graph G,, or its 
complementary graph contains a complete graph of order [,/n]. He writes 
in [177]: 
I still have the copybook in which I wrote down various approaches 
by induction, all they started promisingly, but broke down at various 
points. I had no other support for the truth of this conjecture, than 
the symmetry and some dim feeling of beauty: ...In one of my first 
letters to Erddés after the war I wrote of this conjecture to him. In 
his answer he proved that my conjecture was utterly false ... 


Of course, all we know today, what Erd6és wrote to Turan: the truth 
is around c log n. This was perhaps the first application of probability 
to Graph Theory, though many would deny that Erdés’s elegant answer 
uses more than crude counting. Probably this is where the Theory of 
Random Graphs started. (To be quite precise, one should mention, that 
T. Szele had a similar proof for Rédei’s theorem on directed Hamiltonian 
cycles in tournaments, [167], already in 1943, however, Erdés’s proof was 
perhaps of more impact and it was the first where no other approach 
could replace the counting argument. Another early breakthrough of the 
Random Graph Method was when Erdés easily answered the following 
problem of Schiitte [52]: Is there a tournament where for every k players 
there is a player which beats all of them?) 

I would suggest the reader to read also the beautiful paper of Turan 
[177], providing a lot of information on what I have described above 
shortly. 

(e) For a longer account on the birth of the Erdés—Szekeres version of Ramsey 
theorem see the account of Gy. Szekeres in the introduction of the Art 
of Counting, [60]. For geometric aspects see also Pach—Agarwal [139]. 
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3. Erd6s—Stone Theorem 


Setting out from a problem in topology, Erdés and A. H. Stone proved the 
following theorem in 1946 [93]: 


Theorem 3.1 (Erdés-Stone). For every fixed p and m 
1 n 2 
ex(n, Kpaa(m,...,m)) = (1~=) (2) +0(n?), (2) 


Moreover, if p is fixed and m := ,/€,(n) where £,(x) denotes the p times 
iterated logarithm of x, (2) still holds. 


Here m := \/£,(n) is far from being the best possible. The sharp order 
of magnitude is c log n. Let m = m(n,¢) be the largest integer such that, if 
e(Gn) > e(Tnp) + en, then G, contains the regular (p + 1)—partite graph 
Kp+4i(m,m,...,m). One can ask how large is m = m(n,¢), defined above? 
This was determined by Bollobdés, Erdés, Simonovits [17, 19] and Chvatal 
and Szemerédi [39]. 

The first breakthrough was that the p-times iterated log was replaced by 
K log n, where K is a constant [17]. In the next two steps the dependence of 
this constant on p and € were determined. 


Theorem 3.2 (Bollobds—Erdés—Simonovits [19]). There exists an ab- 
solute constant c > 0 such that every G,, with 


«v2 (-44)() 


contains a K,41(m,m,...,m) with 
clog n 
oe 
p log(1/e) 
The next improvement, essentially settling the problem completely is the 


result of Chvatal and Szemerédi, providing the exact dependence on all the 
parameters, up to an absolute constant. 


Theorem 3.3 (Chvatal—Szemerédi [39]). 


log n emit < clog n 
————— < m(n ——_.. 
500 log(1/e) ; log(1/e) 
One could have thought that the problem is settled but here is a nice 
result of Bollobas and Kohayakawa, improving Theorem 3.3. 


Conjecture 3.4 (Bollobas—Kohayakawa [20]). There exists an absolute 
constant a > 0 such that for allr > 1 and 0 < ¢ < 1/r every G,, of sufficiently 


large order satisfying 
1 n 
n = 1 = 
e(Gn) > ( ste] & 
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contains a K,.41(s0,™mo,---,7™o), where 
= | alogn _ _ |alogn 
so = 80(n) = aa and mo = mo(n) = log i: | 7 
Bollobds and Kohayakawa [20] succeded in proving that under the above 

conditions, if 0 < y < 1, then G, contains a K,+41(s1,7m1,...,71,4), where 

1-y)1 1-— 7)1 
s,= ca q) eee F Mm, = Otte es , and l= laett 1/27 |, 

r log(1/e) log r 


Observe that this result is fairly close to proving Conjecture 3.4: the first 
class is slightly smaller and the last class much larger than in the conjecture. 


Remark [N] 3. In some sense the most important consequence of the 
Chvatal-Szemerédi theorem was that this was the point where the Szemerédi 
Regularity Lemma [169] was really formulated. In the earlier cases, e.g., in 
[168], only weaker, more involved versions of it were used.® 


The Kovari—V. T. Sés—Turan Theorem 


The Kévari-T. Sés—Turadn theorem [123] solves the extremal graph problem of 
Ko2(p,q), at least, provides an upper bound, which in some cases proved to be 
sharp./° This theorem is on the one hand a generalization of the C4-theorem, 
since Cy = K (2,2), and, on the other hand, is a special case of the Erdés— 
Stone theorem, apart from the fact that here we get sharper estimates.!! 


Theorem 3.5 (K6vari-T. Sés—Turadn). Let 2 < p < q be fixed integers. 
Then 


1 
ex(n, K(p,q)) < 5 /q— In"? + O(n). 
Conjecture 3.6. The exponent 2 — (1/p) is sharp: for every p,q > 2, 
ex(n, K(p,q)) > Cpt 


° Szemerédi in [168] used what I call either bipartite or “ugly” regularity lemma. 
In some places, like [120], I stated that this bipartite version was used in [170], 
however, that is not quite correct: a simpler version was enough there. 

1° A footnote of [123] tells us that the authors have received a letter from Erdés 
in which Erdés informed them that he also had proved most of the results of [123]. 


1" Estimates, sharper than in the original Erdés—Stone. 
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Unfortunately, this is known only for p = 2 and p = 3, (see Erdés, Rényi, 
V. T. Sés [81] and independently W. G. Brown [27]). Random graph methods 
[92] show that 


ex(n, K(p,p)) > epn? FT. 
Recently Fiiredi [104] improved the constant in the upper bound, showing 
that 
T= 
ex(n, K(2,t+1)) = 5 Vin? + O(n4/3), 


and that the constant provided by Brown’s construction is sharp. While 
one conjectures that ex(n, K(4,4))/n7/4 converges to a positive limit, we 
know only, by the Brown construction, that ex(n, K(4, 4)) > ex(n, K(3,3)) > 
en®/3, Tt is unknown if 
ex(n, K(4, 4)) 
no/3 
Remark [N] 4 (New constructions). It was a surprising beakthrough, 


when Kollar, Rényai and Tibor Szab6 — using Commutative Algebra — showed 
[219] that 


ex(n, K(p,q)) > éagn' VP), if q> pl, 
proving Conjecture 3.6 for infinitely many cases. This was improved to 
ex(n, K(p,q)) > Cp,grt O/P), if q>(p—1)!, 


by Alon, Rényai and Tibor Szabé, see [294]; an even lower bound is given by 
Ball and Pepe, see [8]. 


Further sources to read: the surveys of Alon [2], Simonovits [159], Fiiredi 
and Simonovits [188]. 


The Matrix Form 


The problem of Zarankiewicz [181] is to determine the maximum integer k,(n) 
such that if A, is a matrix with n rows and n columns consisting exclusively 
of 0’s and 1’s, and the number of 1’s is at least k,(n), then A, contains a 
minor B, of p rows and columns so that all the entries of By are 1’s. 

One can easily see that this problem is equivalent with determining 
the maximum number of edges a bipartite graph G(n,n) can have without 
containing K(p, p). 

In [123] the authors remark that the problem can be generalized to the 
case of general matrices: when A has m rows and n columns and B has p rows 
and g columns. Denote the maximum by k(m, n, p,q). There are many results 
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on estimating this function but we shall not go here into details. Rather, we 
explain the notion of symmetric and asymmetric bipartite graph problems. 
As Erdés pointed out, 


Theorem 3.7. Every graph G,, has a bipartite subgraph H(U,V) in which 
each vertex has at least half of its original degree: d(x) > $dg(x), and 
therefore e(H(U,V)) > $e(G,). 


One important consequence of this almost trivial fact (proved later) is 
that (as to the order of magnitude), it does not matter if we optimize 
e(G,,) over all graphs or only over the bipartite graphs. Another important 
consequence is that some matrix extremal problems are equivalent to graph 
extremal problems. Conversely, many extremal graph problems with bipartite 
excluded subgraphs have equivalent matrix forms as well: 

As usually, having a bipartite graph, G(U, V) we shall associate with it a 
matrix A, where the rows correspond to U, the columns to V and ay; = 1 if 
the 7th element of U is joined to the jth element of V, otherwise a;; = 0. 

Given a bipartite graph L = L(X,Y) and another bipartite graph 
G(U,V), |U| = m and |V| = n, take the m x n adjacency matrix A of 
G and the adjacency-matrix B of L. Assume for a second that the colour- 
classes of L are symmetric (in the sense that there is an automorphism of 
L exchanging the two colour-classes). Then the condition that L Z G(U,V) 
can be formulated by saying that the matrix A has no submatrix equivalent 
to B, where equivalency means that they are the same apart from some 
permutation of the rows and columns. So Turan type problems lead to 
problems of the following forms: 


Given an m x n 0—1 matrix, how many 1’s ensure a submatrix equivalent 
to B? 


If, on the other hand, the forbidden graph L = L(X,Y) has no 
automorphism exchanging X and Y, then the matrix-problem and the graph- 
problem may slightly differ. Excluding the submatrices equivalent to B means 
that we exclude that G(U,V) contains an L with X C U and Y C V, but we 
do not exclude L C G(U,V) in the opposite position. Denote by ex*(n, L) the 
maximum in this asymmetric case. Clearly, ex*(n, L) > ex(n, L), and they 
are equal if L has a colour-swapping automorphism. 


Conjecture 3.8 (Simonovits). If L is bipartite, then ex*(n, L) =O(ex(n, L)). 


We do not know this even for K (4,5). The difficulty in disproving such 
a conjecture is partly that in all the proofs of upper bounds on degenerate 
extremal graph problems, we use only “one-sided” exclusion. Therefore the 
upper bounds we know are always upper bounds on ex*(n, L). 


Conjecture 3.9 (Erdés—Simonovits). For every £ with a bipartite L € £ 
there is a bipartite L* € £ for which ex(n, £) = O(ex(n, L*)). 
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Remark [N] 5. Certain unpublished result of Faudree and Simonovits [253] 
indicate that perhaps this conjecture does not always hold. 


We close this part with a beautiful but probably very difficult problem of 
Erdos. 

Conjecture 3.10. ex(n, {C3,C4}) = san? + o(n3/2), 

The meaning of this conjecture is that excluding C3 beside Cy has the 
same effect as if we excluded all the odd cycles. If we replace C3 by Cs, then 
this is true, see [87]. Erdés risks the even sharper conjecture that the exact 
equality may hold: 


ex(n, {C3, C4}) = ex(n, {C3, C4, Cs, Cy, Co, Cy sas \). 


Remark [N] 6. Some related results can be found in the paper of Keevash, 
Sudakov, and Verstraete [279]: they prove the above conjecture for several 
cases. (See also [209].) 


Further sources to read: For some further information, see a survey paper 
of Richard Guy [106] and also a paper of Guy and Znam [107] on K (p,q) and 
the results of Lazebnik, Ustimenko and Woldar on cycles [124, 125]. 

Also, the problems, posing the problems was a very characteristic feature 
of Paul’s mathematics. Here we neglect this aspect a little. The reader is 
referred to the many problem-posing papers of Erdés, and also to the book 
of Chung and Graham [185]. See also [188]. 


Applications of Ké6vari—T. Sés—Turan Theorem 


It is interesting to observe that the C4-theorem and its immediate generaliza- 
tions (e.g. the Kévari-T. Sés—Turan theorem) have quite a few applications. 
Some of them are in geometry. For example, as Erdés observed, if we have n 
points in the plane, and join two of them if their distance is exactly 1, then the 
resulting graph contains no K (2,3). So the number of unit distances among 
n points of the plane is O(n°/ ?). Similarly, the unit-distance-graph of the 3- 
dimensional space contains no K (3,3), therefore the number of unit distances 
in the 3-space is O(n°/ 3). There are deeper and sharper estimates on this 
subject, see Spencer, Szemerédi and Trotter [164] or Clarkson, Edelsbrunner, 
Guibas, Sharir and Welz] [40]. 


Conjecture 3.11 (Erdés). For every « > 0, the number of unit distances 
among n points of the plain is O(n!**). 


We mention one further application: the chromatic number of the product 
hypergraph. Claude Berge was interested in calculating the chromatic number 
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of the product of two graphs, H and H’. Generally there are various ways to 
define the product of r—uniform hypergraphs. This product is defined as the 
r?—uniform hypergraph whose vertex-set is the Cartesian product V(H) x 
V(H’) and the edge-set is 


{Hx H': HE E(H) and H’€ E(H’)}. 


The chromatic number of the hypergraph is at most k if the vertices can be 
coloured in k colours without having monochromatic r?-tuples. Berge and 
I [10] estimated the chromatic number of products of graphs (hypergraphs) 
using Kévari-T. Sés—Turdn theorem. Also we found a matching lower bound 
by using basically the Brown and Erdés—Rényi-T. Sés constructions [27, 81]. 
The same time a student of Berge, F. Sterboul [165, 166] have proved the 
same theorem and an earlier paper of V. Chvatal [38] used the same technique 
to prove some assertions roughly equivalent with this part of our paper [10]. 


Remark [N] 7. It is an interesting feature of the Berge-Simonovits paper 
that — connected to the above topics — the paper also defines the fractional 
chromatic number. 


4. Graph Theory and Probability 


Erdés wrote two papers with the above title, one in 1959, [48], and the other 
in 1961, [49]. These papers were of great importance. In the first one Erdés 
proved the following theorem. 


Theorem 4.1. For fixed k and sufficiently large @, if n > €'+1/@*), then 
there exist (many) graphs G., of girth k and independence number a(G,) < &. 


Clearly, y(Gn) > eee. So, as Erdés points out, a corollary of this is 


Corollary 4.2. For every integer k for n > no(k) there exist graphs G,, of 
girth > k and chromatic number > nieet, 
This theorem seems to be a purely Ramsey theoretical result, fairly 


surprising in those days, but, it has many important consequences in 
Extremal Graph Theory as well. The same holds for the next theorem, too: 


Theorem 4.3 ([49]). Assume that n > no. Then there exist graphs G, with 
K3 ZG, and a(Gp) = O(,/n log n). 


One important corollary of Theorem 4.1, more precisely, of its proof is 
that 


Theorem 4.4. If £ is finite and contains no trees, then ex(n,£) > cent, 
for some constants c7Z,cc > 0. 


On the other hand, it is easy to see that if LZ € £ is a tree (or a forest), 
then ex(n, £) = O(n). 
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These theorems use random graph methods. They and some of their 
generalizations play also important role, in obtaining lower bounds in Turan— 
Ramsey Theorems. (See also Fiiredi-Seress [105].) For general applications 
of the probabilistic methods in graph theory see, e.g. Erdés—Spencer [92], 
Bollobas [15], Alon—Spencer [7]. 

Of course, speaking of Graph Theory and Probability, one should also 
mention the papers of Erdés and Rényi, perhaps above all, [80}.!” 


Further sources to read: Janson, Luczak, Rucitiski [190], Molloy and Reed 
[195]. 


5. The General Theory 


In this section, we present the asymptotic solution to the general extremal 
problem. 


General Extremal Problem. Given a family £ of forbidden subgraphs, find 
those graphs G,, that contain no subgraph from £ and have the maximum 
number of edges. 


The problem is considered to be “completely solved” if all the extremal 
graphs have been found, at least for n > no(L). Quite often this is too 
difficult, and we must be content with finding ex(n,£), or at least good 
bounds for it. 

It turns out that a parameter related to the chromatic number plays a 
decisive role in many extremal graph theorems. The subchromatic number 
p(L) of L is defined by 


p(L) = min{x(L): Le L}-1. 


The following result is an easy consequence of the Erdés—Stone theorem 
[93]: 


Theorem 5.1 (The Erdés—Simonovits Theorem [84]). If L is a family 
of graphs with subchromatic number p, then 


aS (1 7 | (5) + o(n?). 


The meaning of this theorem is that ex(n, £) depends only very loosely 
on £L; up to an error term of order 0(n), it is already determined by the 
minimum chromatic number of the graphs in CL. 


” They are reprinted in [60, 196] 
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Classification of Extremal Graph Problems 


By the above theorem, 
ex(n, £L) = 0(n”) 


if and only if p(£) = 1, ie. there exist bipartite graphs in £. From the 
Kévari-T. Sés-Turdn Theorem we get that here ex(n, £) = O(n?~°) for some 
c = c(L). We shall call these cases degenerate extremal graph problems and 
find them among the most interesting problems in extremal graph theory. 
One special case is when £ contains a tree (or a forest). These cases could 
be called very degenerate. Observe, that if a problem is non-degenerate, then 


n2 


Tp,2 contains no excluded subgraphs. Therefore ex(n, £) > Ea : 


Structural Results 


The structure of the extremal graphs is also almost determined by p(L), and 
is very similar to that of JT), This is expressed by the following results of 
Erdés and Simonovits [55, 57, 152]: 


Theorem 5.2 (The Asymptotic Structure Theorem). Let £ be a family of 
forbidden graphs with subchromatic number p. If $;, is any graph in EX(n, L), 
then it can be obtained from T,,,, by deleting and adding o(n”) edges. 
Furthermore, if £ is finite, then the minimum degree 


diel Sa= (1 2 ~) DhaGe: 


The structure of extremal graphs is fairly stable, in the sense that the 
almost-extremal graphs have almost the same structure as the extremal 
graphs (for £ or for K,+1). This is expressed in our next result: 


Theorem 5.3 (The First Stability Theorem). Let £ be a family of forbidden 
graphs with subchromatic number p > 2. For every € > 0, there exist a 6 > 0 
and n- such that, if G, contains no L € £, and if, for n > ne, 


e(Gn) > ex(n, £L) — 6n?, 
then G;, can be obtained from T;,,, by changing at most en edges. 


These theorems are interesting on their own and also widely applicable. 

In the remainder of this section we formulate a sharper variant of 
the stability theorem. One can ask whether further information on the 
structure of forbidden subgraphs yields better bounds on ex(n, £) and further 
information on the structure of extremal graphs. At this point, we need a 
definition. 


264 Mikl6és Simonovits 


Definition 5.4. Let £ be a family of forbidden subgraphs, and let p = p(L) 
be its subchromatic number. The decomposition M of £ is the family of 
graphs M with the property that, for some L € L, L contains M as an 
induced subgraph and L — V(M) is (p — 1)-colorable. 


In other words, for r = v(L), L C M x Kp-1(r,...,7), and M is minimal 
with this property. In case of £ = {K,} the family M consists of one graph 
K 2. The following result is due to Simonovits [152], (see also [57]). 


Theorem 5.5 (The Decomposition Theorem, [152]). Let £ be a for- 
bidden family of graphs with p(£) = p and decomposition M. Then 
every extremal graph S, € EX(n,£) can be obtained from a suitable 
K,(m1,-..,2p) by changing O(ex(n,M) + n) edges. Furthermore, nj = 
(n/p) + O(ex(n, M)/n) + O(1), and 


dmin(Sn) = (1 - | n+ O(ex(n, M)/n) + O(1). 

It follows from this theorem that, with m = [n/p], ex(n, £) = e(Tnp) + 
O(ex(m, M) + n). If ex(n, M) > en, then O(ex(m, M)) is sharp: put edges 
into the first class of a T;, , so that they form a G,», € EX(m, M); the resulting 
graph contains no L, and has e(T,,,,) + ex(m, M) edges. 

A second stability theorem can be established using the methods of 
[152]. To formulate it, we introduce some new terms. Consider a partition 
U,,U2,...,U, of the vertex-set of G;,, and the complete p—partite graph H, = 
K(uy,...,Up) corresponding to this partition of V(G,), where u; = |U;|. A 
pair of vertices is called an eztra edge if it is in G, but not in H,, and isa 
missing edge if it is in H,, but not in G,. For given p and G,, the partition 
U,,U2,...,Up is called optimal if the number of missing edges is minimum. 
Finally, for a given vertex v, let a(v) and b(v) denote the numbers of missing 
and extra edges at v, respectively. 


Theorem 5.6 (The Second Stability Theorem). Let £ be a forbidden family 
of graphs with p(£) = p and decomposition M, and let k > 0. Suppose that 
G,, contains no L € £, 


e(G,) > ex(n,£L) — k- ex(n, M), 
and let U;,...,U, be the optimal partition of G,, G; := G(U;). Then 


(i) G, can be obtained from xG; by deleting O(ex(n, M) +n) edges; 

(ii) e(G;) = O(ex(n, M)) + O(n), and |U;| = (n/p) + O(p/ex(n, M) + Vn); 

(iii) For any constant c > 0, the number of vertices v in G; with a(v) > 
cn is only O(1), and the number of vertices with b(v) > cn is only 
O(ex(n, M)/n) + O(1); 

(iv) Let L € L, with v(L) = r, and let A; be the set of vertices v in Uj 
for which b(v) < (n/2pr); if M x Ky-1(71,...,r) D L, then the graph 
G(A;) contains no M. 
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The constant k of the condition cannot be seen in (i)—(iv): it is hidden in 
the constants of the O(.)’s This theorem is useful also in applications. The 
deepest part is the first part of (iii). This implies (iv), which in turn implies 
all the other statements. A proof is sketched in [155], where the theorem was 
needed. 

We conclude this section with the theorem characterizing those cases 
where Ty,» is the extremal graph. 


Theorem 5.7 (Simonovits [155]). The following statements are equiva- 
lent: 


(a) The minimum chromatic number in £ is p+ 1 but there exists (at least 
one) L € £ with an edge e such that y(L—e) = p. (Colour critical edge.) 

(b) There exists an no such that for n > no(L), Th» is extremal. 

(c) There exists an n; such that forn > ni(L), Th p is the only extremal 
graph. 


Remark [N] 8. Here we can observe an important phenomenon, namely, 
that (mostly) if we can prove some theorems for complete graphs K,+1, then 
we can also prove it for cases, where a p+ 1-chromatic L € CL has a critical 
edge. 

We shall see this phenomenon in the Kolaitis-Proémel—Rothschild theo- 
rem, and in many other cases. 


The Product Conjecture 


When I started working in extremal graph theory, I formulated (and later 
slightly modified) a conjecture on the structure of extremal graphs in non- 
degenerate cases (i.e., when no excluded graph L is bipartite). The meaning 
of this conjecture is that all the non-degenerate extremal graph problems can 
be reduced to degenerate extremal graph problems. 


Conjecture 5.8 (Product structure). Let £ be a family of forbidden 
graphs and M be the decomposition family of L. If no trees and forests occur 
in M, then all the extremal graphs S,, for £ have the following structure: 
V(S,,) can be partitioned into p = p(L) subsets Vi,...,Vp so that V; is 
completely joined to V; for every 1< i<j <p. 


This implies that each S, is the product of p graphs G;, where each 
G;, is extremal for some degenerate family £L;,, D M. The meaning of this 
conjecture is that (almost) all the non-degenerate extremal graph problems 
can be reduced to degenerate extremal graph problems. 

One non-trivial illustration of this conjecture is the Octahedron theorem: 


Theorem 5.9 (Erdés—Simonovits [85]). Let Og = K3(2,2,2) (ie. Og is 
the graph defined by the vertices and edges of the octahedron.) If S,, is an 
extremal graph for Og for n > no(Og¢), then S, = Hm ® Hn—m for some 
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m= $n + o(n). Further, Hm is an extremal graph for Cy and Hn» is 
extremal for P3. 


Remark 5.10. The last sentence of this theorem is an easy consequence of 
that S,, is the product of two other graphs of roughly the same size. 


Remark 5.11. In [85] some generalizations of the above theorem can also 
be found. Thus e.g., the analogous product result holds for all the forbidden 
graphs L = Kp41(2,te,...,tp) and L = K,41(3,te,...,tp). Instead of 
formulating the general result we just give an illustration. If e.g., we try to 
apply this to L = K(3,a,b,c), where 3 << a<b<c, then any S, € EX(n, L) 
will be the product of three graphs, where the first one is K (3, a)-extremal, 
the other two are K(1,b)-extremal (i.e. almost (b — 1)-regular). 


Probably the octahedron theorem can be extended to all graphs 
L = Ky41(ti,te,...,tp) and even to more general cases, see [330]. On 
the other hand, in [157] counterexamples are constructed to the product- 
conjecture if we allow trees or forests in the decomposition family. In this case, 
when the decomposition contains trees, both cases can occur: the extremal 
graphs may be non-products and also they may be products. Turan’s theorem 
itself is a product-case, where the decomposition family contains Kz = Py. 


Szemerédi Lemma on Regular Partitions of Graphs 


There are many important tools in Extremal Graph Theory that became 
quite standard to use over the last 20 years. One of them is the Szemerédi 
Regularity Lemma [169]. 

Let G be an arbitrary graph, X,Y Cc V(G) be two disjoint vertex-sets 
and let d(X,Y) denote the edge-density between them: 


Regularity lemma ([169]). For every « > 0, and every integer & there 
exists a ko(e,«%) such that for every G,, we can partition V(G,) into sets 
Vo, Vi,.--, Ve — for some kn <k < ko(e,%) — so that |Vo| < en, each |V;| =m 
for i > 0 and for all but at most €- i) pairs (i,7), for every X CV; and 
Y CV,, satisfying |X|,|Y| > em, we have 


ld(X,Y) - (Vi, Vi) <e. 


The applications of Szemerédi’s Regularity Lemma are plentiful and are 
explained in details in [120], so here we shall describe it only very briefly. 

One feature of the Regularity Lemma is that — in some sense — it allows 
us to handle a deterministic graph as if it were a (generalized) random one. 
One can easily prove for random graphs the existence of various subgraphs 
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and the Regularity Lemma often helps us to ensure the existence of the same 
subgraphs when otherwise that would be far from trivial. 

One example of this is the Erdés—Stone Theorem. Knowing Turdn’s 
theorem, the Szemerédi Lemma immediately implies the Erdés—Stone the- 
orem. In the previous section we have mentioned a few improvements of the 
original Erdds—Stone theorem. The proof of the Chvatal-Szemerédi version 
[39] also uses the Regularity Lemma as its main tool. Joining to the work of 
Thomason [171, 172], Fan Chung, Graham, Wilson [37, 36] and others, V. 
T. Sds and I used the Regularity Lemma to give a transparent description 
of the so-called quasi-random graph sequences [160], that was generalized by 
Fan K. Chung to hypergraphs [35]. 


Remark [N] 9. Actually, the Regularity Lemma described in [35] was 
a “weak hypergraph regularity lemma”. Weak, because it could not be 
connected to an appropriate Counting Lemma. This lemma was obtained 
slightly earlier by Frankl and Rédl [99], but published slightly after [35]. 
Later a Strong Hypergraph Lemma was established by Rédl and his school, 
among others, Nagle [303], Skokan [323, 324], Schacht [319, 320], directly 
connected to a Counting Lemma and also to the Removal Lemma. Tim 
Gowers also needed and established (a) strong hypergraph lemma [263]. See 
also the results of Tao on this subject [334, 335, 336, 337]. 

These results are much more complicated than the weak regularity lemma. 
Just to formulate them takes a lot of time and energy. Here we skip the details. 

The main advantage of Chung’s Hypergraph Regularity Lemma [35] was 
that it helped to extend the Simonovits-Sés results of [160] to hypergraphs. 

Here I mention only that it was extended to hypergraphs by Frankl and 
Rédl [99], see also Chung [35]. Prémel and Steger also use a hypergraph 
version of the Regularity lemma in “induced extremal graph problems” [143]. 


Further sources to read: Diestel’s book [187] also contains a classical 
proof of the regularity lemma. 

Frieze and Kannan produced a Weak Regularity Lemma [257, 258, 259], 
that was very applicable in algorithms, and iterating it we got an other proof 
of the Szemerédi Regularity Lemma. 

Lovadsz and Szegedy [296] followed a similar approach, however, used 
limits of graphs and analysis. 

See also the Theorem and the Removal Lemma of Ruzsa and Szemerédi, 
answering a question of [239], [147], Solymosi [333], Conlon and Fox [242], 
G. Elek and B. Szegedy [247] and many others. 

Rédl and Schacht, setting out from some results of Alon and Shapira, 
extend results on the connection of Property Testing and Regularity Lemma 
to hypergraphs [322]. 
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The (ordinary graph) Regularity Lemma can be generalized in various ways. 
One of these generalizations states that if the edges of G, are r—coloured 
for some fixed r, then we can partition the vertices of the graph so that the 
above Regularity Lemma remains true in all the colours simultaneously. This 
is what we used among others in proving some Turan—Ramsey type theorems 
[72, 73, 74] but it has also many other applications. 


Generalized Regularity lemma. For every ¢ > 0, and integers r, there 
exists a ko(e,%,r) such that for every graph G,, the edges of which are r—- 
coloured, the vertex set V(G,,) can be partitioned into sets Vo,Vi,...,Vke — 
for some & <k < ko(e,«,1r) — so that |Vo| < en, |Vi| =m (is the same) for 
every 1 > 0, and for all but at most e(8) pairs (i,7), for every X CV; and 
Y CV; satisfying |X|,|Y| > em, we have 


\d,(X,Y) —d,(Vi,V;)|<e simultaneously for v=1,...,r, 
where d, (X,Y) is the edge-density in colour v. 


As I mentioned, we describe the various applications of Szemerédi Lemma 
in more details in some other places [120, 292]. 

Algorithmic versions were found by Alon, Duke, Lefmann, Rédl, and 
Yuster [5], and in some sense it was extended to sparse graphs by Kohayakawa 
[119], and Kohayakawa-Rédl [192]. 

I should also mention some new variants due to Komlds, see, for 
example, [120]. 


Remark [N] 10 (Historical remarks). Actually, most of the mathemati- 
cians may think that Szemerédi invented the Regularity Lemma to prove his 
famous result (the Erdés-Turdn conjecture) on the existence of arithmetic 
progressions in sequences of integers of positive density. The truth is that 
the first applications used a less appealing version of the Regularity Lemma, 
or some simpler version. The first occasion where the Szemerédi Regularity 
Lemma was used is the Quantitative Erdés-Stone Theorem of Chvatal and 
Szemerédi [39], (see above as Theorem 3.3). 


Remark [N] 11 (Regularity Lemma, Old and New Areas). Let us 
return to the field of ordinary regularity lemma. If one is interested in it, 
there is a long list of papers that are reasonably easy to read and introduce 
to various aspects of the Regularity Lemma. 


1. Blow Up Lemmas: There are cases, when we wish to embed a spanning 
subgraph into G,,, and it would be easy to embed n — o(n) vertices, but 
we may have trouble with embedding the last few vertices. The Blow- 
Up Lemma was “invented” for this purpose, by Komldés, Sarkdézy and 
Szemerédi, see [289, 290], see also Rédl and Rucitiski [318], 

2. Algorithmic Aspects of Regularity Lemmas [5], ... 
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3. Sparse Regularity Lemmas and their applications [192, 284, 285, 286, 287, 
288], 

4. Regularity Lemma and Property Testing: Alon and Shapira [221, 220], 
Alon, Fischer, Newman, and Shapira [216], Lovasz and Szegedy [297, 298]. 
(An early result on this topic, slightly disguised, can be found in Bollobas— 
Erdés—-Simonovits—Szemerédi [234].) For a “Strong” Regularity Lemma 
see Alon, Fischer, Krivelevich and M. Szegedy [4]. 


6. Turan—Ramsey Problems 


Simonovits and V. T. Sds has a survey [331] on Turan—Ramsey type theorems 
initiated by V. T. Sds, and the connection of this to other fields. These fields 
belong to Extremal Graph Theory and are strongly influenced by Paul Erdés. 
I will touch on these fields only very briefly. 

These problems were partly motivated by applications of graph theory 
to distance distribution. Turdn theorem combined with some geometrical 
facts can provide us with estimates on the number of short distances in 
various geometrical situations. Thus they can be applied in some estimates 
in analysis, probability theory, and so on. It was Erdos who first pointed out 
this possibility of applying Graph Theory to distance distribution theorems 
[47] and later Turdn in [176] initiated investigating these problems more 
systematically. This work culminated in three joint papers of Erdés, Meir, 
Sés and Turan [76, 77, 78].1° 

The structure of the extremal graphs in Turan type theorems seems to 
be too regular. So we arrive at the question: How do the upper bounds in 
extremal graph theorems improve if we exclude graphs very similar to the 
Turan graphs? Basically this was what motivated V. T. Sés [162] in initiating 
a new field of investigation. Erdés joined her and they have proved quite a 
few nice results, see e.g., [89, 90, 91]. 

Let a,(G) denote the maximum cardinality of vertices in G such that the 
subgraph spanned by these vertices contains no K;. 


General Problem. Assume that L1,...,L, are given graphs, and Gy, is a 
graph on n vertices the edges of which are coloured by r colours x1,..., Xr; 
and 


for vy =1,...,r the subgraph of colour x, contains no L, 
and ap,(Gn) <m. 
What is the maximum of e(G,,) under these conditions? 


Originally the general problem was investigated only for p = 2,!4 and one 
breakthrough was the Szemerédi—Bollobas—Erdés theorem: 


'3 See also Corrigendum to [76]. 


4 This means that in all the papers quoted but [74] we restrict ourselves to 
ordinary independent sets. 
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Theorem 6.1 (Szemerédi [170]). If (G,) is a sequence of graphs not 
containing A, and the stability number a(G,,) = o(n), then 


ajz an Ge), (3) 


Erdés asks if the o(n?) error term is necessary: Is it true that in 
Theorem 6.1 the stronger 


also holds? 
Theorem 6.2 (Bollobds—Erdés [18]). Equation (3) is sharp. 


Many estimates concerning general and various special cases of this field 
are proved in [72, 73, 74]. Here we mention just one result: 


Theorem 6.3 (Erdés—Hajnal—Simonovits—Sés—Szemerédi [74]). 


(a) For any integers p > 1 and q > p if ap(Gn) = o(n) and K, Z Gn, then 


(b) For g = pk +1 this upper bound is sharp. 


One of the most intriguing open problems of the field is (among many 
other very interesting questions) 


Problem 6.4. Assume that (G,,) is a sequence of graphs not containing 
K (2, 2,2). If a(G,,) = o(n), does it follow that e(G,,) = o(n”)? 


I conclude this section with a slightly different result of Ajtai, Erdés, 
Komlds and Szemerédi. Let t be the average degree of G,,. Turdn’s theorem 
n 


guarantees an independent set of size 7}. 


Theorem 6.5 ([1]). There exists a contant c > 0 for which, if the number 
of K3 C Gy is o(n?), then 


2e(Gn 
a(Gn) > eX log t for t= wes ) 


The nice feature of the above theorem is that it says: if the number of 
triangles in G,, is o(n®), then the size of the maximum independent set jumps 
by a log-factor. This is sharp: 7 log t is achieved for random graphs. 

Theorem 6.5 can also be interpreted as follows: excluding the triangles (or 
assuming that there are only few triangles in our graph) leads to randomlike 
behaviour. (See also [206, 207].) 


Remark [N] 12 (Some newer results.). Sudakov proved that if a(G,,) is 
“slightly smaller than” e°V!°8", and Ky Z Gn, then e(G,) = o(n?) [332]. 
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Jacob Fox, Po-Shen Loh and Y. Zhao [254] described very precisely the 


phase-transition when G,, does not contain K4 and e(G,) is nearly =n?. 


Jézsef Balogh and Lenz found very nice new constructions to obtain good 
lower bounds for several cases, [227, 228] 

Balogh-Ping-Simonovits [229] investigated, when does e(G,) drastically 
drops, as we decrease a(G,,) “continuously” from (7) to 2. 


Remark [N] 13 (A minor correction). In one of the proofs of [74] we 
(Erdés, Hajnal, Simonovits, Sés, and Szemerédi) used an isoperimetric 
theorem from [232], which turned out to be false. The original proof could 
not be saved. However, Balogh and Lenz [227] saved the essential part of our 
result and extended it to several other cases. For details, see [227]. 


7. Cycles in Graphs 


Cycles play central role in graph theory. Many results provide conditions to 
ensure the existence of some cycles in graphs. Among others, the theory of 
Hamiltonian cycles (and paths) constitute an important part of graph theory. 
The Handbook of Combinatorics contains a chapter by A. Bondy [23] giving 
a lot of information on ensuring cycles via various types of conditions. Also, 
the book of Walther and Voss [179] and the book of Voss [178] contain many 
relevant results. Below we shall approach the theory of degenerate extremal 
graph problems (see Sect. 8) through extremal graph problems with forbidden 
cycles. Of course, one of the simplest extremal graph problems is when C is 
the family of all cycles. If we exclude them, the considered graphs will be all 
the trees and forests; the extremal graphs are the trees. 


Remark 7.1. Describing walks and cycles in graphs is perhaps one of 
those parts of extremal graph theory, where algebraic methods may come in 
more often than in other extremal problems. So here occasionally, and very 
superficially, I will speak of Margulis graphs, Ramanujan graphs and Cayley 
graphs. I feel, these topics are very important, not only because of expander 
graphs but also because they provide new methods to construct nice graphs 
in extremal graph theory. I warmly recommend Noga Alons’ chapter from the 
Handbook of Combinatorics: Tools from Higher Algebra [2], which provides 
a lot of interesting and useful information — among others — on topics I had 
to describe very shortly. 


Excluding Long Cycles 


One problem posed by Turan was the extremal problem of cycles of length m. 
If we exclude all the odd cycles, the extremal graph will be the Turan graph 
Tn,2- What are the extremal graphs if the family of excluded graphs is the 
family Ly of cycles of length at least m. The answer is given by the Erdés— 
Gallai theorem: 
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Theorem 7.2 (Erdés and Gallai [69]). Let L,, ={C, : k > m}. Then 


(i) ™4n - dm? < ex(n,Lm) < ™*n and 
(ii) The connected graphs G,, whose 2-connected blocks are Ky,,-1’s are 


extremal. 


Graphs described in (ii) do not exist for all n, but we get asymptotically 
extremal graphs for all n, by taking those graphs in which one 2-connected 
component has size at most m — 1 and all the other blocks are complete 
m, — 1-graphs. 

The following theorem is the twin of the previous one’s. 


Theorem 7.3 (Erdés and Gallai [69]). 


m—2 


ex(n, Pin) < n. 


The union of | —"~_| vertex disjoint K,,—1 (and one smaller Ky) shows that 


this is sharp: ex(n, Pn) = “52n + O(m?). 


This theorem has a sharper form, proved by Faudree and Schelp [95]. 
They needed the sharper form to prove some Ramsey theorems on paths. 
(See also Kopylov [293].) 

These theorems can also be used to deduce the existence of Hamilton 
paths and cycles. Thus, for example, Theorem 7.2 implies Dirac’s famous 
result: 


Theorem 7.4 (Dirac). If the minimum degree of G2, is at least k, then 
Go, is Hamiltonian. 


7.1. Excluding Fixed Trees 


Erd6s and T. Sdés observed that the same estimates hold both for the path 
Py and the star K2(1,m-—1) and these being two extremes among the trees 
of m vertices, they conjectured that [54]: 


Conjecture 7.5 (Erdés—T. Sés). For any tree Tn, 
m—2 
2 


Of course, this implies ex(n, Tm) < “52n+ O(1). Some asymptotical ap- 
proximations of this conjecture were proved by Ajtai, Komlds and Szemerédi, 
(unpublished), also, the conjecture is proved in its sharp form for some special 
families of trees, like caterpillars, large girth graphs [238], graphs with many 
leaves [327], graphs of small diameters [301], ... 


ex(n, Tim) < 


Remark [N] 14 (Embedding trees into graphs). The above Conjec- 
ture 7.5 is now solved for all suffficiently large trees, by Ajtai, Komlds, 
Simonovits and Szemerédi, though the publishing of this theorem is not yet 
finished [202, 204, 203]. 
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Theorem 7.6 (Ajtai-Komlés-Simonovits-Szemerédi). There exists a 
ko such that for all k > ko the Erdés-S6s conjecture holds. 


There is another conjecture, strongly related to the Erdés-Sés Conjecture, 
namely, the Loebl-Komldés-Sés conjecture. This question has a special form, 
(Loebl) and a general form (Komlds, Sé6s) and the Loebl Conjecture comes 
from a problem of Erdés, Fiiredi, Loebl and Sés [252]. 


Conjecture 7.7 (Komldés-Sés). If G, has at least n/2 vertices of degree 
at least & then it contains all the trees of k + 1 vertices. 


The original problem of Loebl was this Conjecture for n = k. 

This was obtained through a series of partial results by Ajtai-Komlés- 
Szemerédi [208], Yi Zhao [341], Piguet-Stein [309], Cooley-Hladky—Piguet 
[244]... and finally 


Theorem 7.8 (Hladky-Komldés-Piguet-Simonovits-Stein-Szemerédi). 
There exists a kg such that for all k > ko the Komlos—S6s conjecture holds. 


Further sources to read: 


(a) A much more detailed description of this can be found in Simonovits— 
Fiiredi [188], see also Sidorenko [327], Brandt and Dobson [238], and 
Andrew McLennan [301], Saclé and Wozniak [325, 339]. 

(b) As to the Komlés—Sés conjecture, see [273] for the “approximative 
solution” and a longer description of the situation, or [188]. 

(c) We have to remark here that Yi Zhao (a Student of Szemerédi) was 
the first to prove a result in this field, superseding [208], see his PhD 
thesis [340], or his preprint, [341], however, this paper was very difficult 
to read and finally it was superseded by Piguet and Stein [309, 308] and 
Cooley [243]. (See also [274].) 


Excluding Coz 


Since the odd cycles are 3-chromatic colour-critical, one can apply Theo- 
rem 5.7 to them to get 


2 


ex(n, Cop41) = | if n>no(k). 


The case of even cycles is much more fascinating. The upper bound would 
become trivial if we assumed that G,, is (almost) regular and contains no 
cycles of length < 2k. The difficulty comes from that we exclude only Cz. 


Theorem 7.9 (Erdés, Bondy—Simonovits [25]). 


ex(n, Cox) < ckn4¥/* 4 o(ntt/*), 
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Theorem 7.10 (Bondy—Simonovits [25]). If e(Gn) > 100kn'+1/*, then 
Coe CG, for every integer ¢ € [k, kn/*), 


Erdés stated Theorem 7.9 in [54] without proof and conjectured Theo- 
rem 7.10, which we proved. The upper bound on the cycle-length is sharp: 
take a G, which is the union of complete graphs. 

Let us return to Theorem 7.9. Is it sharp? Finite geometrical (and other) 
constructions show that for k = 2,3,5 YES. (Singleton [161], Benson [9], 
Wenger [180] ....) Unfortunately, nobody knows if this is sharp for Cg, or for 
other Co;,’s. 

Faudree and I sharpened Theorem 7.9 in another direction: 


Definition 7.11 (Theta-graph). O0(k, p) is the graph consisting of p vertex- 
independent paths of length & joining two vertices x and y. 


Clearly, O(k,p) is a generalization of Ca,. We have proved 
Theorem 7.12 (Faudree-Simonovits [96]). ex(m, O(k,p)) < cp.pntt/*. 


The Erdés—Rényi Theorem [80] shows that Theorem 7.12 is sharp in the 
sense that 


ex(n, O(k, p)) > cy nitktis as n— oo. 
kp 


One could ask if there are other global ways to state that if a graph has 
many edges then it has many cycles of different length. Erdés and Hajnal 
formulated such a conjecture, which was proved by A. Gydarfds, J. Komldés 
and E. Szemerédi. Among others, they proved 


Theorem 7.13 (Gyarfas-—Komldés—Szemerédi [108]). If dmin(G) > 6 
and ¢1,...,€m are the cycle-lengths of G, then 


1 
DUG 2 clo 6. 


The meaning of this is as follows: If we regard all the graphs with 
minimum degree 6 and try to minimize the sum of the reciprocals of the 
cycle-lengths, two candidates should first be checked. One is the union of 
disjoint K5+41’s, the other is the union of disjoint complete pip graphs 

(6, 6)’s. In the first case we get log 6+-O(1), in the spond one $ log 6+O(1). 
The above theorem asserts that these cases minimize }> + 

Some graph theorists could be surprised by measuring the density of 
cycle lengths this way. Yet, whenever we want to express that something 
is nearly linear, then in number theory we tend to use this measure. Thus, 
e.g. the famous $3,000 problem of Erdés asks for the following sharpening of 
Szemerédi’s theorem on Arithmetic Progressions [168]: 


Conjecture 7.14 (Erdés). Prove that if A= {a < a2 <---} isan infinite 
sequence of positive integers and 
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1 
Th=n 
ay 
then for every k, A contains a k-term arithmetic progression. 


Remark [N] 15. When Erdés posed the above conjecture, he was interested 
among other if 


Q: the set of primes contain arbitrary long arithmetic progressions. 


In the paper of Erdés [250] that appeared in the Birthday Volume “Paul 
Erdos is 80” he mentioned that the longest arithmetic progression of primes 
was of length 17 those days. It was a fantastic breakthrough when Ben Green 
and Terence Tao proved [266] that YES, Q is true. 


Remark [N] 16 (0-click-in). One could weaken Erdés Conjecture by ask- 
ing: Is it true at least, that for every & there is a p = pz for which, for some 
Cr > 0 

ex(n, @(k,p)) > cp-nitV/F? 
One feels that similarly to the Kollar-Rényai-Szabé construction [294] (see 
also Alon-Rényai-Szab6 [219]) one should be able to find a construction 
proving this. J. Verstraéte has some results into this direction [338]. 


Very Long Cycles 


We know that a graph with minimum degree 3 contains a cycle of length at 
most 2 log, n. The other extreme is when (instead of short cycles) we wish to 
ensure very long cycles. We may go much beyond the Erdés—Gallai theorem 
if we increase the connectivity and put an upper bound on the maximum 
degree. 


Theorem 7.15 (Bondy—Entringer [24]). Let f(n,d) be the largest inte- 
ger k& such that every 2—connected G,, with maximum degree d contains a 
cycle of length at least k. Then 

4 log, _,n—4logy_,logg_,n — 20 < f(n,d) < 4logy_,n+4. 


Clearly, the connectivity is needed, otherwise — as we have seen — the 
Erdés—Gallai theorem is sharp. They also considered the case of d-regular 
graphs, proving that 


Theorem 7.16 (Bondy—Entringer [24]). If G, is 2-connected and d- 
regular, then G, contains a cycle of length at least 


max{2d, 4 logy_,n—4 logy_, logg_; — 20}. 


Bondy and I proved that increasing the connectivity leads to a steap 
jump: 
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Theorem 7.17 (Bondy—Simonovits [26]). If G,, is 3-connected and the 
minimum degree of G,, is d, the maximum degree is D, then G, contains a 
cycle of length at least e¢V!°2 " for some c = c(d, D). 


We conjectured that e°v!°? " can be improved to n°. Bill Jackson, Jackson 
and Wormald succeded in proving this: 


Theorem 7.18 (B. Jackson [112, 115]). If Gn is 3-connected and the 
minimum degree of G,, is d, the maximum degree is D, then G,, contains a 
cycle of length at least n° for some c = c(d, D). 


Increasing the connectivity higher does not help in getting longer cycles: 


Theorem 7.19 (Jackson, Parson [113]). For every d > 0 there are 
infinitely many d+ 2-regular d-connected graphs without cycles longer than 
n? for some y = Ya < 1. 


See also [114]. We close this topic with an open problem: 


Conjecture 7.20 (J. A. Bondy). There exists a constant c > 0, such that 
every cyclically 4-connected 3-regular graph G', contains a cycle of length at 
least cn. 


Erd6s—Pé6sa Theorem 


The following question of Gallai is motivated partly by Menger Theorem. If 
G is a graph 
(*) not containing two independent cycles, 


how many vertices are needed to represent all the cycles of G? 

Ks satisfies (x) and we need at least 3 vertices to represent all its cycles. 
Bollobas [11] proved that in all the graphs satisfying (*) there exist 3 vertices 
the deletion of which results in a tree (or forest). More generally, 

Let RC(k) denote the minimum ¢ such that if a graph G contains no k+1 


independent cycles, then one can delete ¢t vertices of G ruining all the 
cycles of the graph. Determine RC(k)! 


Erdés and Pésa [79] proved the existence of two positive constants, c) 
and ce such that 


cik log k < RC(k) < cok log k. (4) 


This theorem is strongly connected to the following extremal graph 
theoretical question: 


Assume that G,, is a graph in which the minimum degree is D. Find an 
upper bound on the girth of the graph. 


Here the usual upper bound is 


Paul Erdos’ Influence on Extremal Graph Theory 277 


2 log n 


* log(D — 1) (5) 


The proof is easy: Assume that the girth is g and let k = [5+]. Take a 
vertex x and denote the set of vertices having distance t from « by X;. Then 
for t < k we have |X;| > (D — 1)|X;_1|. Therefore 


n=v(Gn) >1+D+D(D-1)+D(D-1)?+---+ D(D—- 1). 


This implies (5). 

These things are connected to many other parts of Graph Theory, in some 
sense even to the Robertson—Seymour theory. Below I shall try to convince 
the reader that the Gallai problem is strongly connected to the girth problem. 

In [153] I gave a short proof of the upper bound of (4). My proof goes as 
follows (sketch!): 

Let G,, be an arbitrary graph not containing k + 1 independent circuits. 
Let H,, be a maximal subgraph of G,, all whose degrees are 2, 3 or 4. Then 
one can immediately see that the ramification vertices of Hy, i.e. the vertices 
of degree 3 or 4 represent all the cycles of G,,.'° Let yz: be the number of these 
vertices. Replacing the hanging chains’® by single edges, we get an H,, each 
degree of which is 3 or 4. So one can easily find a cycle C of length < c3 log pu 
in H,. Applying this to H, — C™ (but first cleaning up the resulting low 
degrees) we get another short cycle C®@). This cleaning up is where we have 
to use that the degrees are bounded from above. Iterating this (and using 
that the degrees are bounded from above) one can find c4y/ log 4 vertex— 
independent cycles in G,. Since k < cap/log py, therefore w<csklogk. y 


The Erdés—Pésa theorem is strongly connected with the girth problem. 
If, e.g. we had shorter circuits in graphs with degrees 3 and 4 then the above 
proof would give better upper bound on RC(k) — but that is ruled out. 


Remark [N] 17 (Directed graph version). It was Gallai [262] who first 
asked for extending the Erdés—Pésa theorem to digraphs. The problem is 
highly non-trivial even in its simplest case, when any pair of directed cycles 
have a vertex in common. Perhaps this is why in its published form Gallai 
asked only to prove this simplest case, for me it is obvious that he also meant 
the general case. The general case formally was formulated only 5 years later, 
by D. Younger [342]. 

McCuaig [300], proved that if in a digraph any two directed cycles have 
a common vertex, then one can delete 3 vertices to turn the graph into 
an acyclic one. Reed, Robertson, Seymour, and Thomas settled the general 
question [314], proving that for every integer k > 0 there exists an integer 


' The vertices of degree 4 are not really needed ... 
16 baths all whose inner vertices have degree 2 
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t > 0 such that for every digraph G, either G has k vertex-disjoint directed 
circuits, or G can be made acyclic by deleting at most ¢ vertices. 


The Margulis Graphs and the 
Lubotzky—Phillips-Sarnak Graphs 


Sometimes we insist on finding constructions in certain cases when the 
probabilistic methods work easily. Often finding explicit constructions is very 
difficult. A good example of this is the famous case of the Ramsey 2—colouring, 
where Erdés offered a larger sum (money) for finding a construction of a graph 
of n vertices not containing complete graphs or independent sets of at least 
c log n vertices, for some large constant c. (See Frankl and Wilson [100].) 

Another similar case is the girth problem discussed above, with one 
exception. Namely, in the girth problem Margulis [133] and in constructing 
Ramanujan graphs Margulis [134, 135, 136] and Lubotzky—Phillips-Sarnak 
[128, 129], succeeded in constructing regular graphs G,, of (arbitrary high) 
but fixed degree d and girth at least cg log n. (The original existence proof 
is due to Erdés and Sachs [82] and uses induction.) 

The graphs we consider here are Cayley graphs. Below (skipping many 
details) 


(a) First we explain, why should one try Cayley graphs of non-commutative 
groups, 

(b) Then we give a sketch of the description of the first, simpler Margulis 
graph [133]. 

(c) Finally we list the main features of the Lubotzky—Phillips-Sarnak graph 
[133]. 


(a) Often cyclic graphs are used in the constructions. Cyclic graphs are 
the graphs where a set A, C [1, n] is given, the vertices of our graphs 
are the residue classes Z; (mod n), and Z; is joined to Z; if |i—j| € An 
(or |Jn+%—j| € An). One such well known graph is Q, (the Paley 
graph) obtained by joining Z; to Z, if their difference is a quadratic 
non-residue. The advantage of such graphs is that they have great 
deal of fuzzy (randomlike) structure. From the point of view of the 
short cycles they are not the best: they have many short even cycles. 

(b) Given an arbitrary group G and some elements gi,..., 9: € G, these 
elements generate a Cayley graph on G: we join each a € G to 
the elements agi,...,ag:. This is a digraph. If we are interested in 
ordinary graphs, we choose gj,...,g: so that whenever g is one of 
them, then also g~! € {g1,...,9:}. Thus we get an undirected graph. 
Still, if G is commutative, then this Cayley graph will have many even 
cycles. For example, a, agi, 09192, 491929, |,491929, 92 is (mostly) 
a Cy for commutative groups and a P; for our non-commutative 
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groups. So, if we wanted to obtain Cayley graphs with large girth, we 
have better to start with non-Abelian groups. This is what Margulis 
did in [133]: 

Let X denote the set of all 2 x 2 matrices with integer entries and 
with determinant 1. Pick the following two matrices: 


12 10 
4=(12) aa a= (29). 


It is known that they are independent in the sense that there is no 
non-trivial multiplicative relation between them.'” So, if we take the 
4 matrices A, B, A~! and B~, they generate an infinite Cayley graph 
which is a 4-regular tree. If we take everything mod p, then it is easy 
to see that the tree collapses into a graph of n & p° vertices, in which 
the shortest cycle has length at least c log p for some constant c > 0. 
This yields a sequence of 4-regular graphs X,, with girth = c* log n 
for some c* = 0.91... Margulis also explains, how the above graphs 
can be used in constructing certain (explicit) error-correcting codes. 
Margulis has also generalized this construction (in the same paper) 
to arbitrary even degrees. 


Theorem 7.21 (Margulis [133]). For every « > 0 we have in- 
finitely many values of r, and for each of them an infinity of regular 
graphs X; of degree 2r with girth 


ways (S-<) a 


(c) The next breakthrough was due to Margulis [134] and to Lubotzky, 
Phillips and Sarnak [128]. The graph of Lubotzky, Phillips and Sarnak 
was obtained not for extremal graph purposes. They were interested 
in the extremal spectral gap of d-regular graphs: they constructed 
graphs where the difference between the first and second eigenvalues 
is as large as possible. Graphs with large spectral gaps are good 
expanders, and this was perhaps the primary interest in [128] or 
in [134]. As the authors of [128] remarked, Noga Alon turned their 
attention to the fact that their graphs can be “used” also for many 
other, “classical” purposes. 


Definition 7.22. Let X be a connected k-regular graph. Denote by \(X) 
the second largest eigenvalue (in absolute value) of the adjacency matrix 
of X. 


Definition 7.23. A k-regular graph on n vertices, X = Xy,x, will be called 
a Ramanujan graph, if Xn) < 2Vk —1. 


'” This is far from being trivial but can be proved, or found e.g. in [194]. 
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I do not have the place here to go into details, but the basic idea is 
that random graphs have roughly the spectral gap!® required above and vice 
versa: if the graph has a large spectral gap, then it may be regarded in some 
sense, as if it were a random graph. So the Ramanujan graphs provide near- 
extremum in some problems, where random graphs are near-extremal. (See 
also Alon [3], Chung, Graham and Wilson [37], Fiiredi and Komlés [260], and 
Alon—Boppana [214].) 

Let p,q be distinct primes congruent to 1 mod 4. The Ramanujan graph 
X?P-4 of [128] is a p+ 1-regular Cayley graph of PSL(2, Z,) if the Legendre 
symbol ) = 1 and of PGL(2,Z,) if 7 = —1. (Here Z, is the field of 
integers mod q.) 

Theorem 7.24 (Alon, quoted in [128]). Let X,, = X?4 be a non- 


bipartite Ramanujan graph; (") = 1,k =p+1,n= q(q? — 1)/2. Then 
the independence number 


2Vk—1 
a(Xpq) < ———n. 
k 
Corollary 7.25 ([128]). If X,,,, is a non-bipartite Ramanujan graph, then 
Xn,k) 2 =. 
kak) 2 EF 
Margulis, Lubotzky, Phillips and Sarnak have constructed Ramanujan 
graphs which are p + 1-regular, and 


(a) bipartite with n = q(q? — 1) vertices, satisfying 
2 log n 


girth(Xn p41) > = —O(1) and diam(Xn p41) < +3 


= 3 log p 


Further, they constructed non-bipartite Ramanujan graphs with n = q(q? — 
1)/2 vertices, and with the same diameter estimate and with 


pt+1 
Xx. > ——. 
’ x( np+1) —_ 2,/p 
Putting p = const or p % n° we get constructions of graphs the existence of 
which were known earlier only via random graph methods. As a matter of 
fact, they are better than the known “random constructions”, showing that 


2 
+0(1), a(Xn,p+1) < . 


irth(Xy, >= 
girth( Xn +1) ari 


4 
ex(n, Coz) > cgn** 36+35 


18 — difference between the largest and second largest eigenvalues 
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Further sources to read: 


(a) One may be interested in expander graphs and their constructions 
without really using this deep number theory. An extremely good start is 
to look for results of Avi Wigderson and his surrounding. Here I mention 
just a few papers: [215, 275, 316, 317]. 

(b) Another direction is to read about constructions connected to the 
constructions above, see the book of Lubotzky, or [194, 299], Sarnak 

245, 326]. 

(c) There is also a fascinating area to be mentioned here, on graphs and their 

eigenvalues, Cvetkovic-Doob-Sachs [186], F. Chung [184] Alon—Boppana 

214], Alon, Alon—Milman [218). 

(d) One fascinating area is the use of Expanders in Theoretical Computer 

Science [205, 215]. Perhaps the Ajtai-Komo6s-Szemerédi Sorting Network 

205] was the first example where a bounded degree expander graph was 

used to derandomize the algorithm. 


8. Further Degenerate Extremal Graph Problems 


We have already seen the most important degenerate extremal graph 
problems. Unfortunately we do not have as many results in this field as we 
would like to. Here we mention just a few of them. 


Topological Subgraphs 


Given a graph L, we may associate with it all its topologically equivalent 
forms. Slightly more generally, let T(L) be the set of graphs obtained by 
replacing some edges of L by “hanging chains”, i.e., paths, all inner vertices 
of which are of degree 2. 


Problem 8.1. Find the maximum number of edges a graph G,, can have 
without containing subgraphs from 7 (L). 


Denote the topological complete p-graphs by < K, >. G. Dirac [41] have 
proved that every G,, of 2n — 2 edges contains a < Ky, >. This is sharp: 
Dirac gave a graph G,, of 2n — 3 edges and not containing < Ky, >. Erdés 
and Hajnal pointed out that there exist graphs G,, of cp?n edges and not 
containing < K, >. (This can be seen, e.g. by taking [n/q] vertex-disjoint 
union K(q,q)’s for g = (P?) .) It was a breakthrough when Mader [130] (also 
see [131]) showed 


Theorem 8.2. For every integer p > 0 there exists a D = D(p) such that if 
the minimum degree of G is at least D(p), then G contains a < K, >. 


More precisely, 
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Theorem 8.3. There exists a constant c > 0 such that if e(G) > tn, then 
G contains a < K, > for p = [cv/log ¢]. 


Corollary 8.4. For every L, ex(n, 7 (L)) = O(n). 


Conjecture 8.5 (Erdés-Hajnal—Mader [71, 130]). If e(Gn) > tn, then Gy 
contains a < Ky > with p > cv‘. 


Mader’s result was improved by Komlds and Szemerédi to almost the 
best: 


Theorem 8.6 ([121]). There is a positive c, such that if e(G,,) > tn, then 
G,, contains a < K, > with 


t 
2 Oh a 
Tog 1) 
Very recently, improving some arguments of Alon and Seymour, Bollobas 
and Thomason completely settled Mader’s problem: 


Theorem 8.7 (Bollobds and Thomason [22]). If e(G,) > 256p?n, then 
G,, contains a < K, > 


This means that they have got rid of the log t-power in Theorem 8.6. Their 
proof-method was completely different from that of Komlés and Szemerédi. 
Komldés and Szemerédi slightly later also obtained a proof of Theorem 8.7 
(with some other constants) along their original lines [122]. 


Recursion Theorems 


Recursion theorems could be defined for ordinary graphs and hypergraphs, 
for ordinary degenerate extremal problems and non-degenerate extremal 
graph problems, for supersaturated graph problems,. .. However, here we shall 
restrict our considerations to ordinary degenerate extremal graph problems. 
In this case we have a bipartite L and a procedure assigning an L’ to L. 
Then we wish to deduce upper bounds on ex(n, L’), using upper bounds on 
ex(n, L). To illustrate this, we start with two trivial statements. 


Claim 8.8. Let L be a bipartite graph and L’ be a graph obtained from L 
by attaching a rooted tree T to L at one of its vertices.!? Then 


ex(n, L’) = ex(n, L) + O(n). 


Claim 8.9. Let L be a bipartite graph and L’ be a graph obtained by taking 
two vertex-disjoint copies of L. Then (again) 


'® This means that we take vertex-disjoint copies of L and T, a vertex x € V(T) 
and a vertex y € V(L) and identify x and y. 
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ex(n, L’) = ex(n, L) + O(n). 


The proofs are trivial. 

One of the problems Turan asked in connection with his graph theorem 
was to find the extremal numbers for the graphs of the regular (Platonic) 
polytopes. For the tetrahedron the answer is given by Turan Theorem 
(applied to Ky4). The question of the Octahedron graph is solved by 
Theorem 5.9, the problems of the Icosahedron and Dodecahedron can be 
found in Sect. 12, [154, 155]. On the cube-graph we have 


Theorem 8.10 (Cube Theorem, Erdés—Simonovits [86]). 
ex(n, Qa) = O(n*/®). 


We conjecture that the exponent 8/5 is sharp. Unfortunately we do not 
have any “reasonable” lower bound. 
The above theorem and many others follow from a recursion theorem: 


Theorem 8.11 (Recursion Theorem, [86]). Let L be a bipartite graph, 
coloured in BLUE and RED and K(t, t) be also coloured in BLUE and RED. 
Let L* be the graph obtained from these two (vertex-disjoint) graphs by 
joining each vertex of L to all the vertices of K(t,t) of the other colour. If 
ex(n, L) = O(n?-®) and 


then ex(n, L*) = O(n?-*). 


Applying this recursion theorem with ¢ = 1 and L = C6 we obtain the 
Cube-theorem.”° Another type of recursion theorem was proved by Faudree 
and me in [96]. 


Regular Subgraphs 


Let £L,_reg denote the family of r-regular graphs. Erdés and Sauer posed the 
following problem [64]: 


What is the maximum number of edges in a graph Gy, not containing any 
k-regular subgraph? 


Since (3,3) is 3-regular, one immediately sees that ex(n,£3 reg) = 
O(n°/*). Using the Cube Theorem one gets a better upper bound, ex(n,L3_ req) 
= O(n®/*). Erdés and Sauer conjectured that for every ¢ > 0 there exists an 


20 This approach proved the upper bound for most of the bypartite extremal 
results (at least, up to a contant) known those days. It did not cover Fiiredi’s 
Theorem 8.13. 
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no(k,€) such that for n > no(k,€) ex(n,Le—-reg) < n't©. Pyber proved the 
following stronger theorem. 


Theorem 8.12 (Pyber [144]). For every k, ex(n, Ly—reg) = 50k?n log n. 


The proof is based on a somewhat similar but much less general theorem of 
Alon, Friedland and Kalai [6]. For further information, see e.g. Noga Alon [2] 


One More Theorem 


We close this section with an old problem of Erdés solved not so long ago 
by Fiiredi. Let F'(k,t) be the bipartite graph with k vertices 21,...,2,% and 
(a further vertices in groups U;,; of size t, where all the vertices of UU;; are 
independent and the t vertices of Uj; are joined to x; and z; (l1<i<j<k). 
Erdés asked for the determination of ex(n, F'(k,t)) for t = 1. For t = 1 and 
k = 2 this is just Cy, so the extremal number is O(n3/?). Erdés also proved 
(and it follows from [86] as well) that ex(n, F(3,1)) = O(n3/2). 


Theorem 8.13 (Fiiredi [103]). ex(n, F(k,t)) = O(n°/?). 


Remark [N] 18. The above theorem was extendend by Alon, Krivelevich 
and Sudakov. They proved a conjecture of Erdés and myself according to 
which 


Theorem 8.14 ((217]). If Z is a bipartite graph with the colour classes A 
and B and all the vertices of B have degree at most r then 


ex(n, L) = O(n?-@/7), 


Further sources to read: Fiiredi, Simonovits: [188], Simonovits [159]. 


9. Supersaturated Graphs, Rademacher Type 
Theorems 


Almost immediately after Turdn’s result, Rademacher proved the following 
nice theorem (unpublished, see [50]) : 


Theorem 9.1 (Rademacher Theorem). If e(G,) > [| then G,, con- 
tains at least [4] triangles. 


This is sharp: adding an edge to (the smaller class of) T;, 2 we get [4] [3’s. 
Erdés generalized this result by proving the following two basic theorems [50]: 


Theorem 9.2. There exists a positive constant c; > 0 such that if e(G,) > 


[=7| , then G,, contains an edge e with at least cin triangles on it. 
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Theorem 9.3 (Generalized Rademacher Theorem). There exists a 
positive constant cp > 0 such that if 0 < k < con and e(G,) > | +k, then 


G, contains at least k [%] copies of K3. 


Lovasz and I proved the conjecture of Erdés that cz = 4 [126]. For further 
results see Moser and Moon [138], Bollobds [13, 14], and [126, 127]. 


Remark [N] 19. In our paper [127] we have formulated a general theorem 
for the possible maximum value of triangles, or, more generally, of Kp4+1’s ina 
supersaturated graph G,,. Our Stability approach went slightly further than 
what we published, however, was not enough to solve the general problem, 
not even for K3. The first beakthrough came from Fisher [255], Fisher-Ryan 
[256]. In the last two decades several important results were achieved, by 
Razborov (who created his “flag algebras [312] in extremal graph theory” to 
solve such problems). Recently — after several steps, — (e.g. Nikiforov [305]) 
Reiher [315] finally proved our conjecture. 


Erdos also proved the following theorem, going into the other direction. 


Theorem 9.4 (Erdés [61]). If e(G,) = [7 — and G,, contains at least 


one triangle, then it contains at least [>] — ¢— 1 triangles. 


(Of course, we may assume that 0 < ¢ < [4] — 3.) 


The General Supersaturated Case 


Working with Erd6és on multigraph and digraph extremal problems, Brown 
and I needed some generalizations of some theorems of Erdés [53, 59]. The 
results below are direct generalizations of some theorems of Erdés. To avoid 
proving the theorems in a setting narrower than what might be needed later, 
Brown and I formulated our results in the “most general, still reasonable”?! 
form. 


Definition 9.5 (Directed multi-hypergraphs [33]). A directed (r,q)- 
multi-hypergraph has a set V of vertices, a set H of directed hyperedges, i.e. 
ordered r—tuples, and a multiplicity function w(H) < q (the multiplicity of 
the ordered hyperedge) H € H. 


We shall return to the multigraph and digraph problems later, here I 
formulate only some simpler facts. The extremal graph problems directly gen- 
eralize to directed multi-hypergraphs with bounded hyper-edge-multiplicity: 


Given a family £ of excluded directed (r,q)—multi-hypergraphs, we may 
ask the maximum number of directed hyperedges (counted with multiplic- 


21 Of course, this notion does not exist. 
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ity) a directed (r,q)—multi-hypergraph Gn can have without containing 

forbidden sub-multi-hypergraphs from £. The maximum is again denoted 

by ex(n, L). 

Let L be a directed (r,q)—multi-hypergraph and L{t] be obtained from 
L by replacing each vertex v; of L by a set X; of t independent vertices, 
and forming a directed multihyperedge (y,,...,y,) of multiplicity p if y, € 
Xiz,--+;Yr € Xj, and the corresponding (v;,,..., V;,.) is a directed hyperedge 
of multiplicity w in L. 


Theorem 9.6 (Brown—Simonovits [33]). 
ex(n, L[t]) — ex(n, L) = o(n’). 


Again, the influence of Erdés is very direct: the above theorem is a direct 
generalization of his result in [59]. 


Theorem 9.7 (Brown—Simonovits [33]). Let £ be an arbitrary family of 
(r, g)-hypergraphs, and y = lim ex(mt) as n — 00.2” There exists a constant 


C2 = €2(L,¢) such that, if 
e(Gn) > (yte)n" 


and n is sufficiently large, then there exists some L € £ for which G,, contains 
at least con”) copies of this L. 


Remark [N] 20. This part is only a very short introduction into a very fast 
developing new area. Here we mention only a few results. 


1. One part of this new area is when the problem is non-degenerate and we 
are noticeably above the Turan threshold. Such a general result is the 
above Theorem 9.7. 

2. Lovasz and Simonovits, “still in the ancient times” , proved a conjecture of 
Erdés on the critical value of c in Theorem 9.3. On counting triangles, (or 
k,,’s) in supersaturated graphs, see e.g., Fisher [255], Fisher-Ryan [256], 
Razborov [313], Nikiforov [304], and Reiher [315]; 

3. The third type of results in this field is related to the bipartite excluded 
graphs, Erdés-Simonovits conjecture [159], which has several forms and 
asserts in weaker or stronger forms that among the graphs with e(G,) > 
C'-ex(n, L) the random graphs have the least copies of L. Sidorenko [328, 
329] reformulated these counting questions to inequlities on integrals and 
formulated the “Sidorenko’s conjecture” which is one of the central topics 
nowadays in the theory of graph limits. For more details, see Simonovits 
[159], Erdés-Simonovits [88], Fiiredi and Simonovits [188], Lovasz [193], 
Hatami [270]... 


2 The limit exists, basically because of [117]. 
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10. Typical K,-Free Graphs: 
The Erd6és-Kleitman-Rothschild Theory 


Erdés, Kleitman and Rothschild [75] started investigating the following 
problem: 


How many labelled graphs not containing DL exist on n vertices? 


Denote this number by M(n,L). We have a trivial lower bound on 
M(n, L): take any fixed extremal graph S,, and take all the 2°("") subgraphs 
of it: 


M(n, L) > 2%"), 


In some sense it is irrelevant if we count labelled or unlabelled graphs. The 
number of labelled graphs is at most n! times the number of unlabelled graphs 
and ex(n, L) > [| for all non-degenerate cases, (and ex(n, L) > en!*% for 
all the non-tree-non-forest cases). So, if we are satisfied with rough estimates, 
we may say: counting only labelled graphs is not a real restriction here. 

Strictly speaking, this problem is not an extremal graph problem, neither 
a supersaturated graph problem. However, the answer to the question shows 
that this problem is in surprisingly strong connection with the corresponding 
extremal graph problem. 


Theorem 10.1 (Erdés-Kleitman-Rothschild [75]). The number of K,- 
free graphs on n vertices and the number of p — 1-chromatic graphs on n 
vertices are in logarithm asymptotically equal: For every e(n) > 0 there exists 
an n(n) — 0 such that if M(n,K,,e) denotes the number of graphs of n 
vertices and with at most en” subgraphs K,, then 


ex(n, Ky) < log M(n, Kp,€) < ex(n, Ky) + nm’?. 


In other word, we get “almost all of them” by simply taking all the (p—1)— 
chromatic graphs. 


Remark [N] 21. Counting some structures may be interesting for many 
mathematicians, on its own, however there is an extra nice feature in 
Theorem 10.1, namely, that one gets some feeling for the typical D-free 
structures. 

Graphs were not the first structures to be considered this way. Here I 
mention only one result of Kleitman and Rothschild [281] which asserts that— 
in some sense-the typical posets are described by a random bipartite graph. 


Let us return to the typical structure of an L-free graph. Erdés conjec- 
tured that most of them are very similar to the subgraphs of an extremal 
graph (for L): 

Conjecture 10.2 (Erdés). If x(L) > 2, then 
M(n, L) = gex(n,L)+o(ex(n,L)) 
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This was proved by Erdés, Frankl and Rodl [68]. 

The corresponding question for bipartite graphs is unsolved. Even for the 
simplest non trivial case, i.e. for Cy the results are not satisfactory. This 
is not so surprising. All these problems are connected with random graphs, 
where for low edge-density the problems often become much more difficult. 
Kleitman and Winston [118] showed that 

Mig Cao, 
but the best value of the constant c is unknown. Erdés conjectured that 
M(n, L) — g(1+o(1))ex(n,L) 
Then the truth should be, of course 
M(n,C4) = 9(C1/2)+01)) nvr 


Further sources to read: Kolaitis, Promel and Rothschild extended the 
sharper estimates known for complete graphs to graphs L with critical 
edges [288]. Kleitman and David Wilson obtained results similar to the 
Kleitman—Winston Theorem [282]. Balogh and Samotij extended these results 
to general complete bipartite graphs [230], [231]. The original results of Erdés, 
Kleitman, and Rothschild, or, more generally, of Erdés, Frankl, and Rodl were 
sharpened (better error terms, typical structure, see e.g. Balogh, Bollobas, 
and Simonovits [223, 222, 224]) and extended to other structures, e.g., to 
hypergraphs, see e.g., Nagle and Rédl [302] and Person and Schacht [306]. 


I finish with a recent problem of Erdos. 


Problem 10.3 (Erdés). Determine or estimate the number of maximal 
triangle-free graphs on n vertices. 


Some explanation. In the Erdés—Kleitman—Rothschild case the number of 
bipartite graphs was large enough to give a logarithmically sharp estimate. 
Here K(a,n—a) are the maximal bipartite graphs, their number is negligible. 
This is why the situation becomes less transparent. 


Remark [N] 22. Since those early results many new results were proved in 
this field. 


(a) Sharper estimates were proved for M(n, L) [223]; actually, in some cases 
the typical structures were also described, [224] 

(b) For the first difficult case, to estimate M(n, LZ) Kleitman and Winston 
proved that 


Min, Ci) <2°"" (6) 
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11. Induced Subgraphs 


One could ask, why do we always speak of not necessarily induced subgraphs. 
What if we try to exclude induced copies of L? If we are careless, we 
immediately run into a complete nonsense. If L is not a complete graph 
and we ask: 


What is the maximum number of edges a Gn can have without having an 
induced copy of L? 


the answer is the trivial and the only extremal graph is K,. So let us 


give up this question for a short while and try to attack the corresponding 
counting problem which turned out in the previous section to be in a strong 
connection with the extremal problem. 


How many labelled graphs not containing induced copies of DL are on n 
vertices? 


Denote this number by M*(n,L). Prémel and Steger succeeded in 
describing M*(n, L). They started with the case of C4 and proved that almost 
all G, not containing an induced C4 have the following very specific structure. 
They are split graphs which means that they are obtained by taking a K,, 
and (n — m) further independent points and joining them to K,,, arbitrarily. 
(Trivially, these graphs contain no induced C4’s.) 


Theorem 11.1 (Prémel—Steger [141]). If S* is the family of split graphs, 
then 
M*(n, C4) 
|S7,| 
This implies, by a result of Prémel [140], that 


>las n7>ow. 


Corollary 11.2. There exist two constants, Ceyen > 0 and Coda > 0, such 
that 


M*(n, C4) Ceven for even n, 
gn? /4+n—(1/2)n log n 


Coda for odd n. 
Can one generalize this theorem to arbitrary excluded induced subgraphs? 


To answer this question, first Promel and Steger generalized the notion of 
chromatic number. 


Definition 11.3. Let 7(Z) be the largest integer k for which there exists 
an integer 7 € [0,4 — 1] such that no & — 1-chromatic graph in which j 
colour—classes are replaced by cliques contains DC as an induced subgraph. 


Clearly, if o(L) denotes the clique covering number, (= the minimum 
number of complete subgraphs of L to cover all the vertices of LZ) then 


Lemma 11.4 (Prémel—Steger). \(L),a(L) < 7(L) < x(L) + o(L). 
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Now, taking a T,,,, for p = T(L) — 1 and replacing j appropriate classes 
of it (in the above definition) by complete graphs and then deleting arbitrary 
edges of the T;,,, we get graphs not having induced L’s: 


M*(n, H) > 20- rv) (Z)+0(m"), 
This yields the lower bound in 


Theorem 11.5 (Prémel—-Steger). Let H be a fixed nonempty subgraph 
with T > 3. Then 


M*(n, H) = 20-ANGE)+otn"), 


Definition 11.6. Given a sample graph L, call G,, “good” if there exists 
a fired subgraph U, C G,, (= the complementary graph of G,,) such that 
whichever way we add some edges of Gy, to Un, the resulting U’ contains no 
induced copies of L. ex*(n, H) denotes the maximum number of edges such 


a G,, can have. 


Example 11.7. In case of Cy, any bipartite graph G(A, B) is “good”, since 
taking all the edges in A, no edges in B and some edges from G(A, B) we get 
a U,, not containing Cy as an induced subgraph. 


Theorem 11.8 (Prémel—Steger [143]). 


ex*(n, L) = (1 = —) (5) + o(n2). 


Thus Promel and Steger convincingly showed that there is a possibility 
to generalize ordinary extremal problems and the corresponding counting 
problems to induced subgraph problems. For further information, see [142, 
148, 233, 225]. 


Remark [N] 23. The whole theory described in the previous section which 
could be called Erdés—Kleitman—Rosthschild theory (or perhaps, Erdés— 
Frankl-Rédl theory) has analogs in case of the induced graphs, e.g. Theo- 
rem 11.5. Several interesting results were obtained lately, we mention just a 
few of them below. 

This subarea started with Alekseev [210] and Bollobds, and Thomason 
[235]?°. (See also [236, 237].) 

N. Alon, J. Balogh, B. Bollobas, and R. Morris [213] improve these results, 
by describing the typical structure when excluding an induced L. 

Balogh and Butterfield [226] characterized those graphs L, satisfying that 


For almost all L-freee graphs Gn, V(Gn) can be partitioned into a 
independent sets and b complete graphs, where a+b = r(L) (=the coloring 
number of L). 


23 This volume! 
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So, in some sense, they define the “weakly-edge-color-critical” graphs and 
extend the “ordinary” results to this case as well. 
Here I skip many further interesting results. 


12. The Number of Disjoint Complete Graphs 


There are many problems where instead of ensuring many K,+1’s (see Sect. 9) 
we would like to ensure many edge-disjoint or vertex-disjoint copies of Ky4+1. 
Let us start with the case of vertex-disjoint copies. 

If G, is a graph from which one can delete s — 1 vertices so that the 
resulting graph is p-chromatic, then G,, cannot contain s vertex-disjoint 
copies of K,41. This is sharp: let Hn ips := Tn—sti,p ® Ks—1. Then Hn py, has 
the most edges among the graphs from which one can delete s — 1 vertices to 
get a graph of chromatic number at most p. Further: 


Theorem 12.1 (Moon [137]). Among all the graph not containing s 
vertex-independent K,y+1’s Hn p,s has the most edges, assumed that n > 
No(p, 8). 


This theorem was first proved by Erdés and Gallai for p = 1, then for K3 
by Erdés [51], and then it was generalized for arbitrary p by J. W. Moon, 
and finally, a more general theorem was proved by me [155]. This more 
general theorem contained the answer to Turan’s two “Platonic” problem: 
it guaranteed that H,, 26 is the only extremal graph for the dodecahedron 
graph and H,,.33 for the icosahedron, if n is sufficiently large. For related 
more general results see Simonovits [198]. 

We get a slightly different result, if we look for edge-independent complete 
graphs. Clearly, if one puts k edges into the first class of T,,,,, then one gets k 
edge-independent K,,’s as long as k < cn. One would conjecture that this is 
sharp. As long as k is fixed, the general theorems of [155] provide the correct 
answer. If we wish to find the maximum number of edge-independent copies 


of Kp41 for 
1 n 
Gn) = (1-2) (3) +8 


for k = k(n) > ov, the problem changes in character, see e.g. recent papers 
of Gyéri [109, 111]. We mention just one theorem here: 


Theorem 12.2 (Gy6ri [110]). Let e(Gn) = e(Tn,») + k, (p > 3), where 
k< 3[54 | —5. Then G,, contains k edge-independent K,+1’s, assumed that 


n > no(p). 


For p = 2 (for triangles) the result is different in style, see [109]. 
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13. Extremal Graph Problems Connected 
to Pentagonlike Graphs 


A lemma of Erdés asserts that each graph G,, can be turned into a bipartite 
graph by deleting at most half of its edges. (Above: Theorem 3.7.) The proof 
of this triviality is as follows. Take a bipartite H, C G,, of maximum number 
of edges. By the maximality, each « € V(G,) having degree d(x) in Gp 
must have degree > $d(x) in H,. Summing the degrees in both graphs we 
get e(H,) > 4e(Gn). This estimate is sharp for random graphs of edge 
probability p > 0, in asymptotical sense. Now, our first question is if this 
estimate can be improved in cases when we know some extra information on 
the structure of the graph, say, excluding triangles in G,,. The next theorem 
asserts that this is not so. Let D(G,,) denote the minimum number of edges 
one has to delete from G,, to turn it into a bipartite graph. 


Theorem 13.1 (Erdés [56]). For every ¢ > 0 there exists a constant c = 
ce > 0 such that for infinitely many n, there exists a G,, for which K3 Z Gp, 
e(Gn) > cen”, and 


D(Gn) > (5 - :) (Gn). 


Conjecture 13.2 (Erdés). If K3; Z G,, then one can delete (at most) 
n? /25 edges so that the remaining graph is bipartite. 


Let us call a graph G,, pentagonlike if its vertex-set V can be partitioned 
into Vj,..., V5 so that # € V; and y € V; are joined iff ’—j = +1 mod 5. The 
pentagonlike graph Q,, := Cs[n/5] shows that, if true, this conjecture is sharp. 
The conjecture is still open, in spite of the fact that good approximations 
of its solutions were obtained by Erdés, Faudree, Pach and Spencer. This 
conjecture is proven for e(G,) > ie (see below) and the following (other) 
weakening is also known, [67]: 


Theorem 13.3. If kK3 Z G,, then 


for some (explicite) constant 5 > 0. 
In fact, Erdés, Faudree, Pach, and Spencer [67] proved that 
Theorem 13.4. For every triangle-free graph G with n vertices and m edges 


DG) < max { Fn ES a a} (7) 


2 n?(n? — 2m) n? 


Since the second term of (13.1) decreases in [3n?, $n?], and its value is 


exactly 3:n? for m = £n?, therefore (7) twice implies that if e(G,) > #n?, 


and K3 Z Gp, then D(Gn) < xn?. By Theorem 3.7, trivially, if e(Gn) < 
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en”, then D(Gn) < 3 
unsettled in the middle interval 2n® <e(G,)< % 

The next theorem of Erdés, Gyéri and myaell (70] states that if e(G,,) > 
en? , then the pentagon-like graphs need the most edges to be deleted to 
become bipartite. (This is sharper than the earlier results, since it provides 


also information on the near-extremal structure.) 


sn’. However, the general a is still open: it is 


Theorem 13.5. If K3 Z Gy, and e(G,,) > ss , then there is a pentagonlike 
graph H;* with at least the same number of alee: e(G,,) < e(H;), for which 
D(Gn) < D(z). 


Remark [N] 24. Not so long ago a similar question of Erdés was solved. If 
we consider C5[n/5], this does not contain K3 and contains ~ (2)? copies of 
C;. Erdés conjectured that 


Conjecture 13.6 (Erdés). If K3 Z G,,, then G,, has at most © (#)° copies 
of Cs. 


Andrzej Grzesik [267] and Hatami, Hladky, Kral, and Razborov [271] 
proved this, independently. 


14. Problems on the Booksize of a Graph 


We have already seen a theorem of Erdos, stating that if a graph has many 
edges, then it has an edge e with cn triangles on it. Such configurations are 
usually called books. The existence of such edges is one of the crucial tools 
Erdos used in many of his graph theorems. Still, it was a longstanding open 
problem, what is the proper value of this constant c above. Without going 
into details we just mention three results: 


Theorem 14.1 (Edwards [43, 44]). Ife(G,) > z]. then G,, has an edge 
with [n/6] +1 triangles containing this edge. 


This is sharp. The theorem would follow if we knew that there exists a 
K3 = (x,y,) for which the sum of the degrees, d(x) + d(y) + d(z) > %. 
Indeed, at least | vertices would be joined to the same pair, say, to ry. An 
other paper of Edwards contains results of this type, but only for e(G,,) > 


sn’. Let A, = A,.(G,,) denote the maximum of the sums of the degrees in a 


K, C Gy. (For instance, in a random graph R, A,(R,) =r: 2e(Rn) 
Theorem 14.2 (Edwards [43]). If A, > (1 - 4) n,n > 1, then 
1 5 2e(Gn ) 
A, ——.. 
r+1 hs n 


This theorem says that if G, has enough edges to ensure a K,41, then 
it also contains a K,+,, whose vertex-degree-sum is as large as it should be 
by averaging. Erdds, Faudree and Gyéri have improved Theorem 14.1 if we 
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replace the edge-density condition by the corresponding degree-condition. 
Among others, they have shown that 


Theorem 14.3 (Erdés—Faudree-Gyé6ri [66]). There exists a c > 0 such 
that if the minimum degree of G,, is at least [n/2] + 1, then G,, contains an 
edge with [n/6] + cn triangles containing this edge. 


15. Digraph/Multigraph Extremal Graph Problems 


We have already seen supersaturated extremal graph theorems on multi- 
digraphs. Here we are interested in simple asymptotically extremal sequences 
for digraph extremal problems. 

Multigraph or digraph extremal problems are closely related and in some 
sense the digraph problems are the slightly more general ones. So we shall 
restrict ourselves to digraph extremal problems. A digraph extremal problem 
means that some gq is given and we consider the class of digraphs where loops 
are excluded and any two vertices may be joined by at most qg arcs in one 
direction and by at most gq arcs of the opposite direction. This applies to 
both the excluded graphs and to the graphs on n vertices the edges of which 
should be maximized. So our problem is: 

Fix the multiplicity bound q described above. A family £ of digraphs is 

given and ex(n,£) denotes the maximum number of arcs a digraph Dn 


can have under the condition that it contains no L € £ and satisfies the 
multiplicity condition. Determine or estimate ex(n, L). 


The Digraph and Multigraph Extremal graph problems first occur in a 
paper of Brown and Harary [32]. They described fairly systematically all the 
cases of small forbidden multigraphs or digraphs. Next Erdé6s and Brown 
extended the investigation to the general case, finally I joined the “project”. 
Our papers [28, 29, 30] and [31] describes fairly well the situation q = 1 
for digraphs (which is roughly equivalent with g = 2 for multigraphs). We 
thought that our results can be extended to all gq but Sidorenko [149] and 
then Rédl and Sidorenko [146] ruined all our hopes. One of our main results 
was in a somewhat simplified form: 


Theorem 15.1. Let g = 1 and £ be a given family of excluded digraphs. 
Then there exists a matrix A = (a;;) of r rows and columns, depending 
only on £, such that there exists a sequence (S,,) of asymptotically extremal 
graphs for £ whose vertex-set V can be partitioned into Vi,..., V; so that for 
1<i<j<r,av€/V; isjoinedtoav’ € V; by an arc of this direction iff the 
corresponding matrix-element a;; = 2; further, the subdigraphs spanned by 
the V;’s are either independent sets or tournaments, depending on whether 
ay; = 0 or 1. 


One crucial tool in our research was a density notion for matrices. We 
associated with every matrix A a quadratic form and maximized it over the 
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standard simplex: 


g(A) = max {uAut : So ui =1, u3> 0.} 


The matrices are used to characterize some generalizations of graph sequences 
like (In,p)n>no of the general theory for ordinary graphs, and g(A) measures 
the edge-density of these structures: replaces (1 — +) of the Erdés-Stone— 


. . P 
Simonovits theorem. 


Definition 15.2. A matrix A is called dense if for every submatrix B’ of 
symmetric position g(B’) < g(B). In other words, B is minimal for g(B) = A. 


We conjectured that — as described below — the numbers g(B) are of finite 
multiplicity and well ordered if the matrices are dense: 


Conjecture 15.3. If q is fixed, then for each there are only finitely many 
dense matrices B with g(B) = X. Further, if (B,,) is a sequence of matrices of 
bounded integer entries then (g(B,,)) cannot be strictly monotone decreasing. 


One could wonder how one arrives at such conjectures, but we do not 
have the space to explain that here. Similar matrices (actually, multigraph 
extremal problems) occur when one attacks Turdn—Ramsey problems, see 
(72, 73, 74]. 

Our conjecture was disproved by Sidorenko and Rédl [146]. As a 
consequence, while we feel that the case q = 1 (i.e. the case of digraphs 
where any two points can be joined only by one arc of each direction) is 
sufficiently well described, for g > 1 the problem today seems to be fairly 
hopeless. Multidigraphs have also been considered by Katona in [116], where 
he was primarily interested in continuous versions of Turdn-type extremal 
problems. 


Remark [N] 25. (a) Many further details of this chapter can be found in 
Brown-Simonovits [240]. 

(b) Since the original version of this paper the theory of Graph Limits 
(see e.g. [193]) emerged and many phenomena observed there have 
some (much simpler) analogues in the theory of multigraph extremal 
problems. 

(c) Multigraph (or coloured multigraph) extremal problems — combined 
with stability methods — were used in solving the Hypergraph Extremal 
problem, where the Fano Plane was excluded, see de Caen-Fiiredi [241], 
Fiiredi and Simonovits [261] and Keevash and Sudakov [278]. 

An interesting feature of these solutions was that one did not have to 
assume the bounded multiplicity in the multigraph problem corresponding 
to the Fano hypergraph problem: it followed from the conditions. 

Also, in some sense very special multigraph extremal problems were used 
in the proof of the main Ramsey—Turan theorems of Erdés, Hajnal, Sd6s, and 
Szemerédi [72]. 


296 Mikl6és Simonovits 


16. Erdés and Nassredin 


Let me finish this paper with an anecdote. Nassredin, the hero of many 
middle-east jokes, stories (at least this is how we know it in Budapest), once 
met his friends who were eager to listen to his speech. “Do you know what I 
wish to speak about” Nassredin asked them. “No, we don’t” they answered. 
“Then why should I speak about it” said Nassredin and left.24 Next time 
the friends really wanted to listen to the clever and entertaining Nassredin. 
So, when Nassredin asked the audience “Do you know what I want to speak 
about”, they answered: “YES, we do” . “Then why should I speak about it” 
said Nassredin again and went home. The third time the audience decided to 
be more clever. When Nassredin asked them “Do you know what I will speak 
about”, half of the people said “YES” the other half said “NO”. Nassredin 
probably was lasy to speak: “Those who know what I wanted to tell you 
should tell it to the others” he said and left again.?° 

I am in some sense in Nassredin’s shoes. How could I explain on 30 or 50 
pages the influence of Erdés on Extremal Graph Theory to people who do 
not know it. And why should I explain to those who know it. Yet I think, 
Nassredin did not behave in the most appropriate way. So I tried — as I 
promised — to illustrate on some examples this enourmous influence of Paul. 
I do not think it covered half the topics and I have not tried to be too 
systematic. 

Long Live Paul Erdés!?° 
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Summary There are two central themes to research involving applications of 
probabilistic methods to partially ordered sets. The first of these can be described as 
the study of random partially ordered sets. Among the specific models which have 
been studied are: random labelled posets; random t-dimensional posets; and the 
transitive closure of random graphs. A second theme concentrates on the adaptation 
of random methods so as to be applicable to general partially ordered sets. In 
this paper, we concentrate on the second theme. Among the topics we discuss are 
fibers and co-fibers; the dimension of subposets of the subset lattice; the dimension 
of posets of bounded degree; and fractional dimension. This last topic leads to a 
discussion of Ramsey theoretic questions for probability spaces. 


1. Introduction 


Probabilistic methods have been used extensively throughout combinatorial 
mathematics, so it no great surprise to see that researchers have applied these 
techniques with great success to finite partially ordered sets. One central 
theme to this research is to define appropriate definitions of a random poset, 
and G. Brightwell’s excellent survey article [1] provides a summary of work 
in this direction. 

A second theme involves the application of random methods to more 
general classes of posets. After this brief introductory section, we present four 
examples of this theme. The first example is quite elementary and involves 
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fibers and co-fibers, concepts which generalize the notions of chains and 
antichains. The principal result here is an application of random methods 
to provide a non-trivial upper bound on the minimum size of fibers. 

Our second example is more substantial. It involves the dimension of 
subposets of the subset lattice, an instance in which many of the classic 
techniques and results pioneered by Paul Erdos play major roles. The third 
example involves an application of the Lovasz Local Lemma and leads 
naturally to the the investigation of the dimension of a random poset of 
height two. 

Our last example involves fractional dimension for posets—an area 
where there are many attractive open problems. This topic leads to natural 
questions involving Ramsey theory for probability spaces. 

The remainder of this section is a very brief condensation of key ideas and 
notation necessary for the remaining five sections. In this article, we consider 
a partially ordered set (or poset) P = (X, P) as a discrete structure consisting 
of a set X and a reflexive, antisymmetric and transitive binary relation P on 
X. We call X the ground set of the poset P, and we refer to P as a partial 
order on X. The notations « < y in P, y > x in P and (a,y) € P are used 
interchangeably, and the reference to the partial order P is often dropped 
when its definition is fixed throughout the discussion. We write x < y in P 
andy >ain Pwhenx <yin Panda y. When z,y € X, (x,y) ¢ P and 
(y,x) € P, we say x and y are incomparable and write zx || y in P. 

Although we are concerned almost exclusively with finite posets, i.e., 
those posets with finite ground sets, we find it convenient to use the familiar 
notation R, Q, Z and N to denote respectively the reals, rationals, integers and 
positive integers equipped with the usual orders. Note that these four infinite 
posets are total orders; in each case, any two distinct points are comparable. 
Total orders are also called linear orders, or chains. We use n to denote an 
n-element chain with the points labelled as0 << 1<---<n-—1. 

A subset A C X is called an antichain if no two distinct points in A are 
comparable. We also use P + Q to denote the disjoint sum of P and Q. 

In the remainder of this article, we will assume that the reader is familiar 
with the basic concepts for partially ordered sets, including maximal and 
minimal elements, chains and antichains, sums and cartesian products, com- 
parability graphs and Hasse diagrams. For additional background information 
on posets, the reader is referred to the author’s monograph [23], the survey 
article [14] on dimension by Kelly and Trotter and the author’s survey articles 
[21,22,25] and [26]. Another good source of background information on posets 
is Brightwell’s general survey article [2]. 


2. Fibers and Co-fibers 


The classic theorem of Dilworth [4] asserts that a poset P = (X, P) of width n 
can be partitioned into n chains. Also, a poset of height h can be partitioned 
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into Ah antichains. For graph theorists, these results can be translated into 
the simple statement that comparability graphs are perfect. Against this 
backdrop, researchers have devoted considerable energy to generalizations of 
the concepts of chains and antichains. Here is one such example. 

Let P = (X, P) be a poset. Lone and Rival [18] called a subset A C X 
a co-fiber if it intersects every non-trivial maximal chain in P. Let cof(P) 
denote the least m so that P has a co-fiber of cardinality m. Then let cof(n) 
denote the maximum value of cof(P) taken over all n-element posets. In any 
poset, the set A; consisting of all maximal elements which are not minimal 
elements and the set A» of all minimal elements which are not maximal are 
both co-fibers. As Ay 9 Ag = 9, it follows that cof(n) < |n/2|. On the other 
hand, the fact that cof(n) > |n/2| is evidenced by a height 2 poset with |n/2| 
minimal elements each of which is less than all [n/2] maximal elements. So 
cof(n) = |n/2]| (this argument appears in [18]). 

Dually, a subset B C X is called a fiber if it intersects every non-trivial 
maximal antichain. Let fib(P) denote the least m so that P has a fiber of 
cardinality m. Then let fib(n) denote the maximum value of fib(P) taken 
over all n-element posets. Trivially, fib(n) > |n/2], and Lone and Rival asked 
whether equality holds. 

In [6], Duffus, Sands, Sauer and Woodrow showed that if P = (X, P) 
is an n-element poset, then there exists a set FC X which intersects every 
2-element maximal antichain so that |F'| < |[n/2|. However, B. Sands then 
constructed a 17-point poset in which the smallest fiber contains 9 points. 
This construction was generalized by R. Maltby [19] who proved that for 
every € > 0, there exist a no so that for all n > no there exists an n-element 
poset in which the smallest fiber has at least (8/15 — €)n points. 

From above, there is no elementary way to see that there exists a constant 
a> 0 so that fib(n) < (1—a)n. However, this is an instance where random 
methods provided real insights into the truth. In the remainder of this paper, 
we use the notation [n] to denote the n-element set {1,2,...,n}. (No order 
is implied on [n], except for the natural order on positive integers.) 


Theorem 1. Let P = (X,P) be a poset with |X| =n. Then X contains a 
fiber of cardinality at most 4n/5. Consequently, fib(n) < 4n/5. 


Proof. Let C C X be a maximum chain. Then X — C is a fiber. So we may 
assume that |C| < n/5. Label the points of C as x, < m2 <--- < 2, where 
t = |C| < n/5. Next we define two different partitions of X — C. First, for 
each i € [t], set U; = {2 € X — C: 7 is the least integer for which z || 2; }. 
Then set D; = {% € X — C: 7 is the largest integer for which x || x; }. 

Then for each subset S C [t — 1], define 


B(S) = CU (U{D; +i € S})U (U{Uiar 22 ¢ SP) 


Note that for each i € [t — 1], the maximality of C implies that 
dD; M Uj41 = 0. 
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Claim 1. For every subset S C [t—1], B(S) is a fiber. 


Proof. Let S C [¢—1] and let A be a non-trivial maximal antichain. We show 
that AN B(S) 4 0. This intersection is nonempty if AN C 4 0, so we may 
assume that ANC = @. Now the fact that C is a maximal chain implies that 
every point of C is comparable with one or more points of A. However, no 
point of C can be greater than one point of A and less than another point of 
A. Also, x; can only be less than points in A, and x; can only be greater than 
points in A. It follows that t > 2 and that there is an integer i € [t — 1] and 
points a, a’ € A for which 2; < a in P and 2j41 >a’ in P. Clearly, a’ € D; 
and a € Uj41. If i € S, then D; C B(S), and if i ¢ S, then Ui41 C B(S). In 
either case, we conclude that AN B(S) 4 0. 


Claim 2. The expected cardinality of B(S) with all subsets S' C [t—1] equally 
likely is t + 3(n — t)/4. 


Proof. Note that C C B(S), for all S. For each element x € X —C, let i and 
j be the unique integers for which « € D; and x € U;. Then j A i+ 1. It 
follows that the probability that x belongs to B(S) is exactly 3/4. 


To complete the proof of the theorem, we note that there is some S$ C 
[t — 1] for which the fiber B(S) has at most t + 3(n — t)/4 points. However, 
t <n/5 implies that t + 3(n — t)/4 < 4n/5. 


The preceding theorem remains an interesting (although admittedly 
elementary) illustration of applying random methods to general partially 
ordered sets. Characteristically, it shows that an n-point poset has a 
fiber containing at most 4n/5 points without actually producing the fiber. 
Furthermore, this is also an instance in which the constant provided by 
random methods can be improved by another approach. 

The following result is due to Duffus, Kierstead and Trotter [5]. 


Theorem 2 (Duffus, Kierstead and Trotter). Let P = (X, P) be a poset 
and let H be the hypergraph of non-trivial maximal antichains of P. Then the 
chromatic number of H is at most 3. 


Theorem 2 shows that fib(n) < 2n/3, since whenever X = By U By U Bs 
is a 3-coloring of the hypergraph H of non-trivial maximal antichains, then 
the union of any two of {B,, Bo, B3} is a fiber. Quite recently, Lone [17] has 
obtained the following interesting result providing a better upper bound for 
posets with small width. 


Theorem 3 (Lonc). Let P = (X,P) be a poset of width 3 and let |X| =n. 
P has a fiber of cardinality at most 11n/18. 


I am still tempted to assert that lim, 4. fib(n)/n = 2/3. 
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3. Dimension Theory 


When P = (X, P) is a poset, a linear order L on X is called a linear extension 
of P when x < y in LE for all z, y € X with « < y in P. A set FR of linear 
extensions of P is called a realizer of P when P = NR, ie., for all x, y in 
X,« < yin P if and only if « < y in ZL, for every L € R. The minimum 
cardinality of a realizer of P is called the dimension of P and is denoted 
dim(P). 

It is useful to have a simple test to determine whether a family of linear 
extensions of P is actually a realizer. The first such test is just a reformulation 
of the definition. Let inc(P) = inc(X, P) denote the set of all incomparable 
pairs in P. Then a family 7 of linear extensions of P is a realizer of P if and 
only if for every (a, y) € inc(X, P), there exist distinct linear extensions L, 
L'€R so that x > yin Landy>vz in L’. 

Here is a more useful test. Call a pair (x,y) € X x X a critical pair if: 


1. x || y in P; 
2. 2 <a in P implies z < y in P, for all z € X; and 
3. w > yin P implies w > z in P, for allw Ee X. 


The set of all critical pairs of P is denoted crit(P) or crit(X, P). Then 
it is easy to see that a family R of linear extensions of P is a realizer of P 
if and only if for every critical pair (x,y), there is some L € R with « > y 
in L. We say that a linear order L on X reverses (x,y) if « > y in L. So 
the dimension of a poset is just the minimum number of linear extensions 
required to reverse all critical pairs. 

For each n > 3, let S,, denote the height 2 poset with n minimal elements 
@1,@2,...,@n, m maximal elements 1, b2,...,bn and a; < bj, for i,j € [n] 
and 7 #7. The poset S,, is called the standard example of an n-dimensional 
poset. Note that dim(S,,) is at most n, since crit(S,) = {(ai,bi) : i € [n]} 
and n linear extensions suffice to reverse the n critical pairs in crit(S,,). On 
the other hand, dim(S,,) > n, since no linear extension can reverse more than 
one critical pair. 


4. The Dimension of Subposets of the Subset Lattice 


For integers k, r and n with 1 < k < r < n, let P(k,r;n) denote 
the poset consisting all k-element and all r-element subsets of {1,2,...,n} 
partially ordered by inclusion. For simplicity, we use dim(k,7r;n) to denote 
the dimension of P(k, 7; 7). 

Historically, most researchers have concentrated on the case k = 1. Ina 
classic 1950 paper in dimension theory, Dushnik [7] gave an exact formula 
for dim(1,r;n), when r > 2\/n. 
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Theorem 4 (Dushnik). Let n, r and j be positive integers with n > 4 and 
2/n-2<r<n-1. Ifj is the unique integer with 2< j <./n for which 


97 4 72 (a 2 

|" Fl cp<|* 29 A+G =|, 
J qj-1 

then dim(1,r;n) =n—it+1. 


No general formula for dim(1,r;7) is known when r is relatively small in 
comparison to n, although some surprisingly tight estimates have been found. 
Here is a very brief overview of this work, beginning with an elementary 
reformulation of the problem. When LF is a linear order on X, S C X and 
ze X—S,wesayx > Sin Ll when «> sin L, for every se S. 


Proposition 1. dim(1,r;n) is the least t so that there exist t linear orders 
Ly, L2,...L4 of |n] so that for every r-element subset S C [n| and every 
x € [n| — S, there is some i € [t] for which a > S in L,. 


Spencer [20] used this proposition to estimate dim(1, 2;n). First, he noted 
that by the Erdds-Szekeres theorem, if n > 22" and R is any set of ¢ linear 
orders on [n], then there exists a 3-element set {x,y,z} C [n] so that for all 
LER, either x<y<zinLorg>y>zin L. Thus dim(1,2;n) >t when 
n > 2?. On the other hand, ifn < 22" then there exists a family 72 of ¢ linear 
orders on [n] so that for every 3-element subset S C [n] and every x € S, 
there exists some L € R so that either x < S—{ax} in Lora > S— {x} in L. 
Then let S' be the family of 2t linear orders on X determined by adding to R 
the duals of the linear orders in R. Clearly, the 2¢ linear orders in S satisfy 
the requirements of Proposition 1 when r = 2, and we conclude: 


Theorem 5 (Spencer). For alln > 4, 
lg lg n < dim(1,2;n) < 2 1g lg n. 


Spencer [20] then proceeded to determine a more accurate upper bound 
for dim(1,2;) using a technique applicable to larger values of r. Let t be a 
positive integer, and let F be a family of subsets of [t]. Then let r be an integer 
with 1 <r < t. We say F is r-scrambling if |F| > r and for every sequence 
(Aj, Ao,...,A,) of r distinct sets from F and for every subset B C [r], there 
is an element a € [t] so that a € Ag if and only if 6 € B. We let M(r,t) 
denote the maximum size of a r-scrambling family of subsets of [t]. Spencer 
then applied the Erdés/Ko/Rado theorem to provide a precise answer for the 
size of M(2,t). 


Theorem 6 (Spencer). M(2,t) = (\s=2)), for allt > 4. 
2 
As a consequence, Spencer observed that 


Ig lg n < dim(1,2;n) < lg lg n+ (§ +0(1)) lg lg Ig n. 
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Almost 20 years later, Fiiredi, Hajnal, R6dl and Trotter [13] were able to 
show that the upper bound in this inequality is tight, i-e., 


dim(1,2;n) = lg lg n + (4 + o(1)) lg Ig Ig n. 


For larger values of r, Spencer used random methods to produce the 
following bound. 


Theorem 7 (Spencer). For every r > 2, there exists a constant c= cp > 1 
so that M(r,t) > ct’. 


Proof. Let p be a positive integer and consider the set of all sequences of 
length p whose elements are subsets of [¢]. There are 2?* such sequences. The 
number of such sequences which fail to be r-scrambling is easily seen to be 


at most 
(*) ar (ar — 1)tg-)t, 


r 


os 


So at least one of these sequences is a r-scrambling family of subsets of 
provided (?)2"(2" — 1)*2-"* < 2". Clearly this inequality holds for p > c 
where c = c, ~ e72” is a constant larger than 1. 


Here’s how the concept of scrambling families is used in provide upper 
bounds for dim(q, 1; 7). 


Theorem 8 (Spencer). [fp = M(r,t) and n = 2?, then dim(1,r;n) < t. 


Proof. Let F be an r-scrambling family of subsets of [t], say F = {Aj, Ag,..., 
Ap} where p = M(r,t). Then set n = 2? and let Qi,Qz2,...,Qn be the 
subsets of [p]. For each a € [t], define a linear order L, on the set [n] by 
the following rules. Let x and y be distinct integers from [n] and let u = 
min((Qz — Qy) U (Qy — Qz)). Set « > y in La if either 


l.a€ A, and u€ Qz — Qy, or 
2.a¢ A, and u€ Qy — Qz. 


It is not immediately clear why L, is a linear order on [n] for each a € [E], 
but it is easy to check that this is so. Now let S be an r-element subset of 
[n] and let x € [n] — S. We must show that x > S in L, for some a € [E]. 
For each y € S, let uy = min((Qz — Q,) U (Q, UQz)) and then consider the 
family {A,, : y € S}. Since F is a r-scrambling family of subsets of [t], there 
exists some a € [t] such that a € A, if and only if uy € Qz. It follows from 
the definition of L, that x > S in Ly. 


By paying just a bit of attention to constants, the preceding results of 
Spencer actually yield the following upper bound on dim(1,1r; 7). 


Theorem 9 (Spencer). For allr > 2, dim(1,7r;n) < (1+0(1)) ger2” lg Ig n. 
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Of course, this bound is only meaningful if r is relatively small in 
comparison to n, but in this range, it is surprisingly tight. The following 
lower bound is a quite recent result due to Kierstead. 

Theorem 10 (Kierstead). If 2 <r < lglg n—lg lg lg n, then 


(r+2—lg lg n+lg lglg n)? lg n 
32 lg(r +2—lg lg n+lg lg lg n) 


< dim(1,r; 7). 


We will return to the issue of estimating dim(1,7r;7) in the next section. 


5. The Dimension of Posets of Bounded Degree 


Given a poset P = (X, P) and a point x € X, define the degree of x in P, 
denoted degp(«), as the number of points in X which are comparable to z, 
This is just the degree of the vertex x in the associated comparability graph. 
Then define A(P) as the maximum degree of P. Finally, define Dim(k) as 
the maximum dimension of a poset P with A(P) < k. Rédl and Trotter were 
the first to prove that Dim(k) is well defined. Their argument showed that 
Dim(k) < 2k? +2. It is now possible to present a very short argument for this 
result by first developing the following idea due to Fiiredi and Kahn [12]. 

For a poset P = (X, P) and a point x € X, let U(x) = {ye X:y>xrin 
P} and let U[x] = U(x) U {x}. Dually, let D(a) = {ye X : y <a in P} and 
D[x] = D(x) U {a}. The following proposition admits an elementary proof. 
In fact, something more can be said, and we will comment on this in the next 
section. 


Proposition 2 (Fiiredi and Kahn). Let P = (X,P) be a poset and let L be 
any linear order on X. Then there exist a linear extension L' of P so that if 
(x,y) is a critical pair and x > Dy] in L, then x > y in L’, so that x > D{y| 
in L’. 

Theorem 11 (Rédl and Trotter). If P = (X,P) is a poset with A(P) <k, 
then dim(P) < 2k? + 2. 


Proof. Define a graph G = (X, E) as follows. The vertex set X is the ground 
set of P. The edge set E contains those two element subsets {x,y} for which 
Ula] NU[y] # 9. Clearly, the maximum degree of a vertex in G is at most 
k*. Therefore, the chromatic number of G is at most k? +1. Let t= k? +1 
and let X = X, U X2U...U X; be a partition of X into subsets which are 
independent in G. Then for each 7 € [é], let L; be any linear order on X 
with X; > X — X; in L;. Finally, define L,,; to be any linear order on X so 
that: 


1. X; > X — X; in Lisi; and 
2. The restriction of Dy4; to X; is the dual of the restriction of LD; to X;. 
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We claim that for every critical pair (#,y) € crit(P), if « € X;, then 
either x > D{y] in L; or « > Diy] in L44;. This claim follows easily from the 
observation that any two points of D[y] are adjacent in G so that |D[y] N 
Ae) <1 


Fiiredi and Kahn [12] made a dramatic improvement in the upper bound 
for in Dim(k) by applying the Lovasz Local Lemma [9]. We sketch their 
argument which begins with an application of random methods to provide 
an upper bound for dim(1,7; 7). In this sketch, we make no attempt to provide 
the best possible constants. 


Theorem 12 (Fiiredi and Kahn). Let r and n be integers with 1 <r <n. 
Ift is an integer such that 


n—1 rot 
1 
nts, @) 
then dim(1,r;n) <t. In particular, dim(1,r;n) < r(r + 1) log(n/r). 


Proof. Let t be an integer satisfying the inequality given in the statement of 
the theorem. Then let {Z; : i € [t]} be a sequence of t random linear orders 
on X. The expected number of pairs (x, S) where S is an r-element subset of 
[n], x € [n]—S and there is no i € [¢] for which z > S in L; is exactly what the 
left hand side of this inequality is calculating. It follows that this quantity is 
less than one, so the probability that there are no such pairs is positive. This 
shows that dim(1,r;n) < t. The estimate dim(1,r;n) < r(r + 1) log(n/r) 
follows easily. 


Theorem 13 (Fiiredi and Kahn). [f P = (X,P) is a poset for which 
A(P) < k, then dim(P) < 100k log*k, i.e., Dim(k) < 100k log? k. 


Proof. The inequality dim(P) < 100k log?k follows from the preceding 
theorem if k < 1,000, so we may assume that k > 1,000. Set m = [k/log k] 
and r = [9 log k]. Using the Lovdsz Local Lemma, we see that there exists 
a partition X = Y; UY2...UYm, with |D[a]N Y;| <r, for every « € X. Now 
fix 7 € [m], let g = rk +1 and let s = dim(1,r;q). We construct a family 
Ry = {Ly | :| € [e fi} as follows. 

Let G be the graph on X defined in the proof of Theorem 11. Then let 
G; be the subgraph induced by Y;. Now it is easy to see that any point of 
Y; is adjacent to at most rk other points in Y; in the graph G,. It follows 
that the chromatic number of G; is at most rk +1. Let Y; = Yi U...UYi,g 
be a partition into subsets each of which is independent in G;. Then let 
R= {M, : | € [f]} be a family of linear orders of [q] so that for every 
r-element subset S C [q] and every x € [q]—S, there is some 7 € [s] for which 
x > Sin M;. 

Then for each 7 € [s], define L;,; as any linear order for which: 


1. Y; > X — Y; in Li; and 
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2. Ifa< bin M;, then Vina < Yip in Ly5. 
Finally, for each j € [s], define L;,,4; as any linear order for which: 


1. Y; > X — Y; in Lin s+js 

2.Ifa<bin M;, then Yi,a < Yio in Lij and 

3. Ifa € [q], then the restriction of L;,,4; to Yj,q is the dual of the restriction 
of Lig to Yeas 


Next we claim that if (x,y) is a critical pair and x € Yj, then there is 
some 7 € [2s] so that x > D[y] in L(i,7). To see this observe that any two 
points in Dy] are adjacent in G so at most r points in D[y] belong to Yj, 
and all points of Diy] NY; belong to distinct subsets in the partition of Y; 
into independent subsets. Let x € Y;,;0. Then there exists some j € [s] so 
that jo > j in M; whenever j ¥ jo and Diy| NYi,; #0. It follows that either 
x > Diy] in L(t,7) or & > Diy] in L(t, s+ 7). 

Finally, we note that s = dim(1,r;q) < r(r+1) log(q¢/r), so that dim(P) < 
100k log? k as claimed. 


There are two fundamentally important problems which leap out from 
the preceding inequality limiting the dimension of posets of bounded degree, 
beginning with the obvious question: Is the inequality Dim(k) = O(k log? k) 
best possible? However, the details of the proof also suggest that the 
inequality could be improved if one could provide a better upper bound 
than dim(1,log k;k) = O(log’ k). Unfortunately, the second approach will 
not yield much as Kierstead [15] has recently provided the following lower 
bound. 


Theorem 14 (Kierstead). If lglg n—lg lg lgn<r< gis! then 


(r +2 —Ig lg n +lg lg Ig lg n) on <dim(1,r;n) < —$—. 
32lg(r + 2 —Ig lg n+ 1g Ig lg n) =e 


(2) 


As a consequence, it follows that dim(1,log k;k) = Q(log® k/ log log k). 
So the remaining challenge is to provide better lower bounds on Dim(k). 
Random methods seem to be our best hope. Here is a sketch of the technique 
used by Erdés, Kierstead and Trotter [8] to show that Dim(k) = Q(k log k). 

For a fixed positive integer n, consider a random poset P,, having n 
minimal elements a1, da2,...,@, and n maximal elements bj, bo,...,b,. The 
order relation is defined by setting a; < b; with probability p = p(n); also, 
events corresponding to distinct min-max pairs are independent. 

Erdés, Kierstead and Trotter then determine estimates for the expected 
value of the dimension of the resulting random poset. The arguments are far 
too complex to be conveniently summarized here, as they make non-trivial 
use of correlation inequalities. However, the following theorem summarizes 
the lower bounds obtained in [8]. 


Theorem 15 (Erdés, Kierstead and Trotter). 
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1. For every € > 0, there exists 6 > 0 so that if 


log <p< 


log n’ 
then 
dim(P) > dpn log pn, for almost all P. 


2. For every € > 0, there exist 6,c > 0 so that if 


< p< 1 meee 
log n 


then 
cn 


dim(P) > max{dn,n — \, for almost all P. 


p log n 
The following result is then an easy corollary. 

Corollary 1 (Erdés, Kierstead and Trotter). For every « > 0, there exists 

6 > 0 so that if 


mite < D < 


log n’ 
then 
dim(P) > dA(P) log n, for almost all P. 
Summarizing, we now know that 
Q(k log k) = D(k) = O(k log’ k). (3) 


It is the author’s opinion that the upper bound is more likely to be correct and 
that the proof of this assertion will come from investigating the dimension 
of a slightly different model of random height 2 posets. For integers n and k 
with k& large but much smaller than n, we consider a poset with n minimal 
points and n maximal points. However, the comparabilities come from taking 
k random matchings. 

The techniques used by Erdés, Kierstead and Trotter in [8] break down 
when p = o(log n/n). But this is just the point at which we can no longer 
guarantee that the maximum degree is O(pn). 


6. Fractional Dimension and Ramsey Theory 
for Probability Spaces 


In many instances, it is useful to consider a fractional version of an integer 
valued combinatorial parameter, as in many cases, the resulting LP relaxation 
sheds light on the original problem. In [3], Brightwell and Scheinerman 
proposed to investigate fractional dimension for posets. This concept has 
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already produced some interesting results, and many appealing questions 
have been raised. Here’s a brief sketch of some questions with immediate 
connections to random methods. 

Let P = (X, P) be a poset and let F = {M.,..., My} be a multiset of 
linear extensions of P. Brightwell and Scheinerman [3] call F a k-fold realizer 
of P if for each incomparable pair (a, y), there are at least k linear extensions 
in F which reverse the pair (x,y), ie., |{¢:1<i<t,a>yin M;}| >k, The 
fractional dimension of P, denoted by fdim(P), is then defined as the least 
real number gq > 1 for which there exists a k-fold realizer F = {Mj,..., Mz} 
of P so that k/t > 1/q (it is easily verified that the least upper bound of 
such real numbers gq is indeed attained and is a rational number). Using this 
terminology, the dimension of P is just the least ¢ for which there exists a 
1-fold realizer of P. It follows immediately that fdim(P) < dim(P), for every 
poset P. 

Note that the standard example of an n-dimensional poset also has 
fractional dimension n. Brightwell and Scheinerman [3] proved that if P 
is a poset and |D(x)| < k, for all  € X, then fdim(P) < k + 2. They 
conjectured that this inequality could be improved to fdim(P) < +1. This 
was proved by Felsner and Trotter [10], and the argument yielded a much 
stronger conclusion, a result with much the same flavor as Brooks’ theorem 
for graphs. 


Theorem 16 (Felsner and Trotter). Let k be a positive integer, and let P 
be any poset with |D(a)| < k, for alla € X. Then fdim(P) < k +1. 
Furthermore, if k > 2, then fdim(P) < k+1 unless one of the components of 
P is isomorphic to Sy41, the standard example of a poset of dimension k+1. 


We do not discuss the proof of this result here except to comment that 
it requires a strengthening of Proposition 2, and to note that it implies that 
the fractional dimension of the poset P(1,r;n) is r +1. Thus a poset can 
have large dimension and small fractional dimension. However, there is one 
elementary bound which limits dimension in terms of fractional dimension. 


Theorem 17. If P = (X, P) is a poset with |X| =n and fdim(P) = q, then 
dim(P) < (2+ 0(1))q log n. 


Proof. Let F be a multi-realizer of P so that Probr|x > y] > 1/gq, for every 
critical pair (x,y) € crit(P). Then take t to be any integer for which 


n(n —1)(1—1/q)* <1. 


Then let {L1,..., £,} be a sequence of length t in which the linear extensions 
in F are equally likely to be chosen. Then the expected number of critical 
pairs which are not reversed is less than one, so the probability that we have 
a realizer of cardinality t is positive. 


Felsner and Trotter [10] derive several other inequalities for fractional 
dimension, and these lead to some challenging problems as to the relative 
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tightness of inequalities similar to the one given in Theorem 16. However, 
the subject of fractional dimension has produced a number of challenging 
problems which are certain to require random methods in their solutions. 
Here is two such problems, one of which has recently been solved. 

A poset P = (X,P) is called an interval order if there exists a family 
{[ax, bz] : « € X} of non-empty closed intervals of R so that x < y in P if and 
only if b, < ay in R. Fishburn [11] showed that a poset is an interval order 
if and only if it does not contain 2 + 2 as a subposet. The interval order I, 
consisting of all intervals with integer endpoints from {1,2,...,n} is called 
the canonical interval order. 

Although posets of height 2 can have arbitrarily large dimension, this is 
not true for interval orders. For these posets, large height is a prerequisite for 
large dimension. 


Theorem 18 (Fiiredi, Hajnal, Rédl and Trotter). If P = (X,P) is an 
interval order of height n, then 


dim(P) < lg lg n + (1/2 + o(1)) Ig Ig lg n. (4) 
The inequality in the preceding theorem is best possible. 


Theorem 19 (Fiiredi, Hajnal, Rédl and Trotter). The dimension of the 
canonical interval order satisfies 


dim(I,,) = lg lg n + (1/2 + o(1)) lg Ig lg n. (5) 


Although interval orders may have large dimension, they have bounded 
fractional dimension. Brightwell and Scheinerman [3] proved that the 
dimension of any finite interval order is less than 4, and they conjectured 
that for every € > 0, there exists an interval order with dimension greater 
than 4 — «. We believe that this conjecture is correct, but confess that our 
intuition is not really tested. For example, no interval order is known to have 
fractional dimension greater than 3. 

Motivated by the preceding inequalities and the known bounds on 
the dimension and fractional dimension of interval orders and the posets 
P(1,r;n), Brightwell asked whether there exists a function f : Q > R 
so that if P = (X,P) is a poset with |X| = n and fdim(P) = q, then 
dim(P) < f(q)lg lg n. If such a function exists, then the family P(1,r;7n) 
shows that we would need to have f(g) = (2%). 

But we will show that there is no such function. The argument requires 
some additional notation and terminology. Fix integers n and k with 1 <k < 
n. We call an ordered pair (A, B) of k-element sets a (k,n)-shift pair if there 
exists a (k + 1)-element subset C = {t1 < ig < +++ < tei C {1,2,...,n} so 
that A = {t1,7%2,...,7,} and B = {i2,%3,...,i4%41. We then define the (k, n)- 
shift graph S(k,n) as the graph whose vertex set consists of all k-element 
subsets of {1,2,...,n} with a k-element set A adjacent to a k-element set 
B exactly when (A, B) is a (k,n)-shift pair. Note that the (1,n) shift graph 
S(1,n) is just a complete graph. It is customary to call a (2,n)-shift graph 
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just a shift graph; similarly, a (3, n)-shift graph is called a double shift graph. 
The formula for the chromatic number of the (2,n)-shift graph S(2,n) is a 
folklore result of graph theory: x(S(2,n)) = [lg n]. Several researchers in 
graph theory have told me that this result is due to Andras Hajnal, but 
Andras says that it is not. In any case, it is an easy exercise. 

The following construction exploits the properties of the shift graph to 
provide a negative answer for Brightwell’s question. 


Theorem 20. For every m > 3, there exists a poset P = (X,P) so that 


1, |X| =m"; 
2. dim(X, P) > lg m; and 
3. fdim(X, P) < 4. 


Proof. The poset P = (X, P) is constructed as follows. Set X = {x(i,j) : 
1 < i,j < m}, so that |X| = m?. The partial order P is defined by first 
defining (i, 71) < (i, 72) in P, for each i € [m] whenever 1 < ji < jo < 
m. Furthermore, for each i € [m], x(i1, 71) < x(i2,j2) in P if and only if 
(ig — 1) + (J2 — fa) > m. 

We now show that dim(X,P) > lg m. Note first that for each 7,7 with 
1l<i<j<m, x(i,j —7%) || c(j,m). Let dim(X,P) = t, and let R = 
{Lo, Le,..., Lu} be a realizer of P. For each i, 7 with 1 < i < 7 < m, choose 
an integer o({i,7}) = a © {1,2,...,t} so that x(i,7 — i) > x(j,m) in La. 
We claim that ¢ is a proper coloring of the (2,m) shift graph $(1,m) using t 
colors, which requires that dim(X, P) = t > y(S(2,m) = [lg m]. To see that 
@ is a proper coloring, let 1,7 and k be integers with 1 <i<j<k<™m, let 
o({t,7}) = a@ and let o({j,k}) = 8. Ifa = GB, then x(t, 7 —i) > x(j,m) in La 
and x(j,k — j) > a(k,m) in La. Also, x(j,m) > «(j,k — 7) in P. However, 
since (k —i) + (m—j +14) >, it follows that x(k,m) > x(i,j — 7) in P, so 
that 2(k,m) > x(t,7 — 7) in Ly. Thus, 


x(t,j —1) > a(j,m) > «(j,k — 39) > a(k,m) > x(t,7 — 7) in P (6) 


The inequalities in equation 6 cannot all be true. The contradiction shows 
that ¢ is a proper coloring of the shift graph S(2,m) as claimed. In turn, this 
shows that dim(X, P) > [lg m]. 

Finally, we show that fdim(X,P) < 4. For each element x € X, let pi 
and pz be the natural projection maps defined by p(x) = i and po(a) = j 
when « = x(t, 7). Next, we claim that for each subset A C [ml], there exists a 
linear extension L(A) of P so that x > y in L(A) if: 


1. x || y in P; 
2. pi(z) € A and pi(y) ¢ A. 


To show that such linear extensions exist, we use the alternating cycle test 
(see Chap. 2 of [23]). Let A C [mJ], and let S(A) = {(a,y) © Xx X:a || y 
in P,pi(a) € A and pi(y) ¢ A}. Now suppose that {(uxg,vg) : 1 < k < 
p} C S(A) is an alternating cycle of length p, i.e., uz || vy and ug < vgqi in 
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P, for all k € [p] (subscripts are interpreted cyclically). Let k € [m]. Then 
pi(ur) € A and pi(vg4i1) ¢ A. It follows that uz < vg;1 in P, for each 
k € [p]. It follows that pi(ug41 — pi(up) + po(ve+1) — po(up) > m. Also, we 
know that m > p1 (vp) —p1(ug) +p2(ve) — po(ug). Thus pi (vg41) +p2(ve41) > 
pi(ug) + p2(v,). Clearly, this last inequality cannot hold for all k € [p]. The 
contradiction shows that S(A) cannot contain any alternating cycles. Thus 
the desired linear extension L(A) exists. 

Finally, we note that if we take F = {£(A): AC [{}]} and set s = |F], 
then x > y in at least s/4 of the linear extensions in F, whenever « || y in P. 
To see this, observe that there are exactly 2°/4 subsets of [m] which contain 
pi(x) but do not contain p(y). This shows that fdim(X, P) < 4 as claimed. 
It also completes the proof of the theorem. 


Now we turn our attention to the double shift graph. If P = (X,P) isa 
poset, a subset D C X is called a down set, or an order ideal, if x < y in P 
and y € D always imply that « € P. The following result appears in [13] but 
may have been known to other researchers in the area. 


Theorem 21. Let n be a positive integer. Then the chromatic number of the 
double shift graph S(3,n) is the least t so that there are at least n down sets 
in the subset lattice 2°. 


The problem of counting the number of down sets in the subset lattice 2¢ 
is a classic problem and is traditionally called Dedekind’s problem. Although 
no closed form expression is known, relatively tight asymptotic formulas 
have been given. For our purposes, the estimate provided by Kleitman and 
Markovsky [16] suffices. Theorem 21, coupled with the estimates from [16] 
permit the following surprisingly accurate estimate on the chromatic number 
x(S(3, 2)) of the double shift graph [13]. 


Theorem 22 (Fiiredi, Hajnal, Rodl and Trotter). 


x(S(3,7)) = 1g lg n+ (1/2 + o(1)) lg Ig lg n. 


Now that we have introduced the double shift graph, the following 
elementary observation can be made [13]. 


Proposition 3. For each n > 3, dim(1,2;n) > x(S(3,n)), and dim(I,,) > 
x(S(3,n)). 


Although the original intent was to investigate questions involving the 
fractional dimension of posets, Trotter and Winkler [27] began to attack a 
Ramsey theoretic problem for probability spaces which seems to have broader 
implications. Fix an integer k > 1, and let n > k + 1. Now suppose that 2 
is a probability space containing an event FE, for every k-element subset S C 
{1,2,...,n}. We abuse terminology slightly and use the notation Prob(S) 
rather than Prob(£3). 
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Now let f(Q) denote the minimum value of Prob(AB), taken over all 
(k,n)-shift pairs (A, B). Note that we are evaluating the probability that A 
is true and B is false. Then let f(n,&) denote the maximum value of f(Q) 
and let f(k) denote the limit of f(n,k) as n tends to infinity. 

Even the case k = 1 is non-trivial, as it takes some work to show that 
f(1) = 1/4. However, there is a natural interpretation of this result. Given 
a sufficiently long sequence of events, it is inescapable that there are two 
events, A and B with A occurring before B in the sequence, so that 


Prob(AB) < : +€. 

The + term in this inequality represents coin flips. The € is present 
because, for finite n, we can always do slightly better than tossing a fair 
coin. 

For & = 2, Trotter and Winkler [27] show that f(2) = 1/3. Note that this 
is just the fractional chromatic number of the double shift graph. This result 
is also natural and comes from taking a random linear order L on {1,2,...,n} 
and then saying that a 2-element set {i,7} is true if i < j in L. Trotter and 
Winkler conjecture that f(3) = 3/8, f(4) = 2/5, and are able to prove that 
limg+oo f (kK) = 1/2. They originally conjectured that f(k) = k/(2k+ 2), but 
they have since been able to show that f(5) > at which is larger than 3. 

As an added bonus to this line of research, we are beginning to ask 
natural (and I suspect quite important) questions about patterns appearing 
in probability spaces. 
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III. Infinity 


Introduction 


Paul Erd6s was always interested in infinity. One of his earliest results is 
an infinite analogue of (the then very recent) Menger’s theorem (which was 
included in a classical book of his teacher Denes Kénig). Two out of his 
earliest three combinatorial papers are devoted to infinite graphs. According 
to his personal recollections, Erdés always had an interest in “large cardinals” 
and his earliest work on this subject are joint papers with A. Tarski from 
the end of 1930s. These interests evolved over the years into the Giant Triple 
Paper, with the Partition Calculus forming a field rightly called here Erdésian 
Set Theory. 

We wish to thank Andras Hajnal for a beautiful paper which perhaps 
best captures the special style and spirit of Erdés’ mathematics. We solicited 
another two survey papers as well. An extensive survey was written by Peter 
Cameron on the seemingly simple subject of the infinite random graph, which 
describes the surprising discovery of Rado and Erd6s-Rényi finding many new 
fascinating connections and applications. The paper by Peter Komjath deals 
with another (this time more geometrical) aspect of Erdésian set theory. In 
addition, the research articles by Shelah and Kriz complement the broad 
scope of today’s set theory research, while the paper of Aharoni looks at 
another pre-war Erd6s conjecture. It is only with pleasure that we can remark 
that the Erdés—Menger problem has ben solved: 

R. Aharoni, E. Berger, “Menger’s Theorem for infinite graphs”. Inven- 
tiones Mathematicae 176 (2009), 1-62. 

In 1995/1996, when the contents of these volumes was already crystallis- 
ing, we asked Paul Erdés to isolate a few problems, both recent and old, 
for each of the eight main parts of this book. To this part on infinity he 
contributed the following collection of problems and comments. 
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Erdés in his own words 


I have nearly 50 joint papers with Hajnal on set theory and many with 
Rado and Fodor and many triple papers and I only state a few samples of 
our results. The first was my result with Dushnik and Miller. 

Let m > No. Then 


m7 (m, No)3. 


I use the arrow notation invented by R. Rado—in human language: If one 
colors the pairs of a set of power m > No by two colors either in color 1 there 
is a complete graph of power m or in color 2 an infinite complete graph. 
Hajnal, Rado and I nearly completely settled m — (n,q)3 but the results are 
very technical and can be found in our joint triple paper and in our book. 
In one of our joint papers, Hajnal and I proved that a graph G of chromatic 
number X; contains all finite bipartite graphs and with Shelah and Hajnal we 
proved that it contains all sufficiently large odd cycles (Hajnal and Komjath 
have sharper results). 

Hajnal and I have quite a few results on property B. A family of sets 
{A,} has property B if there is a set C which meets each of the sets Ag 
and contains none of them. This definition is due to Miller. It is now more 
customary to call the family two chromatic. Here is a sample of our many 
results: Let {A,} be a family of &, countable sets with |A,gM Ag| < n. Then 
there is a set S which meets each A, in a set of size (k + 1)n+1. The result 
is best possible. If & > w then there is a set for which |SMA,| < No. We have 
to assume the generalized hypothesis of the continuum. 

Todoréevié proved with c = X; that one can color the edges of the 
complete graph on |.$| = X, with &, colors so that every S; C S, |Si| =i, 
contains all colors. 

We wrote two papers on solved and unsolved problems in set theory. Most 
of them have been superseded in many cases because undecidability raised 
its ugly head (according to many: its pretty head). Here is a problem where, 
as far as we know, no progress has been made. One can divide the triples of a 
set of power 22" into t classes so that every set of power X; contains a triple 
of both classes. On the other hand if we divide the triples of a set of power 
(22° )* into two classes there is always a set of size Ny all whose triples are 
in the same class. If S = gene can we divide the triples into two classes so 
that every subset of size 8; should contain a K(4) of both classes (or more 
generally a homogeneous subset of size No)? I offer 500 dollars for clearing 
up this problem. 

Erdés—Galvin-Hajnal problem. Let G have chromatic number XN ,. Can one 
color the edges by 2 (or Xo, or Xi) colors so that if we divide the vertices into 
No classes there always is a class which contains all the colors? Todoréevié 
proved this if G is the complete graph of 8, vertices—in the general case 


Hajnal and Komjath have some results. 
28 26 2 2k 
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So much P. Erdés. The paper with Hajnal and Shelah is: 

P. Erdés, A. Hajnal, S. Shelah, Topics in topology (Proc. Colloq. 
Keszthely, 1972), Colloq. Math. J. Bolyai Soc. 8, North Holland, 1974, 
243-245. 

A. Hajnal is of course one of the most frequent collaborators of P. Erdés. 
His more recent survey is: 

A. Hajnal, On the chromatic number of graphs and set systems. PIMS 
Distinguished Chair Lectures, University of Calgary, 2004. 

The Rado graph (in today’s terminology, the universal ultra homogeneous 
graph) is also the countable random graph considered by Erdés and Rényi. 
The recent development here was motivated by the connection of ultra 
homogeneous structures and Ramsey classes (see the preceding part on 
Ramsey theory) and on the other side by the connections to model theory and 
mathematical logic. This is described briefly in the update of Peter Cameron’s 
paper. 
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Summary. We discuss a few issues concerning Erd6s’ conjecture on the extension 
of Menger’s theorem to infinite graphs. A key role is given to a lemma to which 
the conjecture can probably be reduced. The paper is intended to be expository, so 
rather than claim completeness of proofs, we chose to prove the reduction only for 
graphs of size X81. We prove the lemma (and hence the N: case of the conjecture) in 
two special cases: graphs with countable out-degrees, and graphs with no unending 
paths. We also present new versions of the proofs of the (already known) cases 
of countable graphs and graphs with no infinite paths. A main tool used is a 
transformation converting the graph into a bipartite graph. 


1. Introduction 


1.1. The Problem 


Kénig’s classical book [7] contains a proof of an infinite version of Menger’s 
theorem: 


Theorem 1. Given any two disjoint sets, A and B, of vertices in any graph, 
the minimal cardinality of an A-B separating set of vertices is equal to the 
maximal number of vertex-disjoint A-B paths. 


The proof was by a very young mathematician, Paul Erdos. In fact, it is 
not very hard: Let F be a maximal set (with respect to inclusion) of vertex 
disjoint paths, and let S be the set of all vertices participating in paths from 
F. Clearly, then, S is A-B-separating. If F is infinite, then |S| = |F| and 
the theorem is proved. If F is finite then so is S, and then techniques from 
the finite case can be applied. But this proof shows that Theorem 1 is not 
really the right extension to the infinite case: the separating set is of the right 
cardinality, but still it is obviously too large. It is not clear when did Erdés 
form the “right” conjecture, which is: 


Conjecture 1. For any two verter sets A and B in a graph, there exists a 
set F of disjoint A-B paths and an A-B separating set of vertices S, such 
that S consists of the choice of precisely one vertex from each path in F. 


Definition 1. A pair (F,S') as in the conjecture is called orthogonal. 
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As is well known, one can restrict the considerations to digraphs. The 
undirected case follows by the customary device of replacing each edge by a 
pair of oppositely directed edges. 

The main developments so far have been a proof of the countable bipartite 
case [8] (where, of course, it is assumed that A and B are the two sides of 
the graph); the case of countable graphs containing no infinite paths [9]; the 
general bipartite case [2]; the case of general graphs containing no infinite 
paths [1], and the countable case [3]. 

In [1] it was realized that in the absence of infinite paths, the conjecture 
can be reduced to the bipartite case. The simple transformation leading to 
the reduction is one of the main themes of the present paper. We shall follow 
it a bit further, and see how it can be used in other cases. 

In fact, already in [9] it was realized that the main difference between 
the bipartite case and the general one stems from the possible existence of 
infinite paths. The reason is that in the construction of F, while trying to 
reach B from A you may end up with infinite paths instead of A-B paths. 
This obstacle was overcome in [3] in the countable case, but the proof there 
relies very heavily on countability. 


1.2. Warps, Waves and Hindrances 


A triple T = (G, A,B), where G = G(L) is a digraph, and A = A(T), B = 
B(T) are subsets of V(G), is called a web. A warp is a set of disjoint paths 
(the term is taken from weaving). If the initial points of the warp are all in 
A then it is called A-starting. 

The initial vertex of a path P (if such a vertex exists) is denoted by in(P), 
and its terminal vertex by ter(P). The vertex set of P is denoted by V(P) 
and its edge set by E(P). For a family P of paths, we write V[P], E[P], in[P], 
ter|P]| for the vertex set, edge set, initial points set, and terminal points set 
of P, respectively. A linkage is a warp £ such that in[£] = A and ter[L] C B. 
A web is called linkable if it contains a linkage. 

Given a graph G and a subset R of its vertices, we write G[R] for the 
subgraph of G induced by R. For any set T we write G— T = G[V(G) \ T]. 
By I —T we denote the web (G—T,A\T,B\T). Given a path P (possibly 
in a super-web of I) we write [— P for T.—V(P). For an A-starting warp W, 
we write '/W for the web (G — (V[W] \ ter[W]), (A \ in[W]) U ter[W], B). 
(So, it is the web obtained by moving the points of in[W] along the paths of 
W, to serve as new source points.) For a warp consisting of a single path P 
we shall write ['/P for T/{P}. 

If a warp W is A-starting and ter[W] is A-B separating then W is called a 
wave. The trivial wave is the set of all singleton paths of the form (a),a € A. 
A hindrance is a wave W such that in[W] 4 A. Obviously, a hindrance is 
a non-trivial wave. A web is called hindered if it contains a hindrance. It is 
called loose if it contains no non-trivial wave (and then, of course, it is also 
unhindered). 
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For a wave W we write [ZW for the web (G—(V[W]\ter[W]), ter[W], B) 
(the difference between this and ['/W is that points in A\in[W] are not taken 
here as source points). 

Let W be a wave, and let T be the set of vertices in V which are separated 
by ter[W] from B. We write |W] for the web (G[Z], A, ter[W]). 

If P,,Po,..., Py are paths and x;, 1 < i < k are vertices such that 
VE V(P;) MN V (Pi41) then P12, P22 a .0p_-1Py denotes the path (if indeed 
it is a path) obtained by going along P, until reaching x7,, then switching 
to P2, then switching at x2 to Ps, and so forth. If the sequence ends with 
a vertex, rather than a path, the path intended is the one ending at that 
vertex. So, for example, P, means the part of P up to and including x. The 
same goes for sequences starting with a vertex. If x = ter(P) = in(Q) then 
PQ denotes the path PxQ. 

A useful operation between waves is the following: let /, W be waves. We 
write U + W for the warp {PxQ : P €U,Q © W,x = ter(P) € V(Q) and 
V(aQ) N VU] = {x}}. It was proved in [3] that U + W is a wave. 

There are two natural orders defined between waves: write W < U if U is 
an extension of W, i.e., every path P € UY is a continuation of some path Q 
from W such that V(P) \ V(Q) UV[W] = 0. As usual, “<” means “< and 
not equal”. Note that if U is obtained from W by the deletion of some paths 
then W <U. 

Let W XU if ter[W] is separated from B by ter|U/]. It is straightforward 
to see that both relations are indeed partial orders, and that W <U implies 
W XU. In [3] the following was proved: 


Lemma 1. In any webT there exists a maximal wave in the order <. If W 
is such a maximal wave then [ZW is loose. 


We shall often use this lemma without explicit mention. 
Also the following is easy: 


Lemma 2. If WAU thenuUtW>uU. 


In [3] it was shown that Conjecture 1 is equivalent to the following 
conjecture: 


Conjecture 2. An unhindered web is linkable. 


The proof of the equivalence is not hard. One direction is even trivial: 
assuming Conjecture 1, if F is not a linkage then the warp {Ps: PE F,s€S 
and s € V(P)} is a hindrance. 

For the proof of the other direction, let W be a <-maximal wave in T’. By 
Lemma 1 the web [ZW is loose. Assuming Conjecture 2 there exists then a 
linkage £ in TZW. Taking the concatenation of W and £ as F and ter[W] 
as S$ in Conjecture 1 yields then the conjecture. 

It is Conjecture 2 which we shall try to solve. As already mentioned, the 
countable case of the conjecture is known [3]. The key lemma there is: 
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Lemma 3. Jf T is an unhindered countable web then for every a € A there 
exists ana— B path P such that . — P is unhindered. 


(The countable case of Conjecture 2 follows immediately: if [ is unhin- 
dered, then by the lemma one can link the vertices a; of A into B by paths 
P; one by one, while keeping the web ! — U{V(P;) : 7 < i} unhindered.) 

In [4] a slight generalization of the countable case was proved: 


Lemma 4. Conjecture 2 is true for webs T in which there exists an A-B 
warp W such that in| A] \ in[W] is countable. 


Note that the condition in Lemma 4 is not hereditary, that is, it is 
not passed on to sub-webs. Hence the proof that Conjecture 2 implies 
Conjecture 1 does not show immediately that Conjecture 1 is true for webs 
as in the above lemma. But this was proved in [5]: 


Lemma 5. Conjecture 1 is true for webs satisfying the condition of Lemma 4. 
We believe that Lemma 3 is true in all webs: 
Conjecture 3. Lemma 8 is true in any web. 


The main purpose of this paper is to prove a reduction of Conjecture 1 
to Conjecture 3. But in order not go into too technical details we shall do it 
only for webs of size 1. 

We shall prove Conjecture 3 (for graphs of size 8;) in two special cases: 
webs with countable out-degrees, and webs with no unending paths. We shall 
therefore be able to prove Conjectures 1 and 2 for these two classes. In both 
proofs we shall use extensively the bipartite conversion, both technically 
and for inspiration. While not absolutely necessary, this will prove to be 
economical. The countable outdegrees case includes, of course, the case of 
countable webs, and the proof presented here is therefore also a new proof 
for this case (although the main ideas are similar to those of the old one, that 
in [3]). The new proof is more natural when considered in the language of 
the bipartite conversion. 


1.3. The Bipartite Conversion 


With any web I we associate a bipartite web b([) = (b(G), b(A), b(B)) in the 
following way. Every vertex v in V \ A is assigned a vertex v” in V(b(G)), 
and every vertex vu € V \ B is assigned a vertex v’. (So, vertices in V \ (AUB) 
are assigned two vertices each.) The edge set £(b(G)) is defined as {(2’, y”) : 
(x,y) € E(T)}U{(a', 2”): 2 €V\(AUB)}. The “source set” of 6([), which 
will be denoted by 6(A), is just {v’: vue V \ B}. The “destination set”, b(B), 
is {u” :v EV \ A}. 

A linkage F in T corresponds in a natural way to a linkage b(F) = 
{(z',y”") : (ay) © E[FI$ U {(a', 2") : « € V[F]} in d(L). Conversely, to 
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a linkage J in b(T) (linkages in bipartite webs are just matchings, hence they 
are denoted by capital letters, rather than script letters), there corresponds 
a warp w(J) in T, defined by E[w(J)] = {(a,y) : (2’,y”) € J, « # y}. (In 
general, backwards transformations from b([) to T will be denoted by w.) 
We write w(J) for the set of those paths in w(J) which begin at A. 

The following lemma is very easy to verify: 


Lemma 6. Let J be a linkage in b([). Then in[w(J)| = in[w(J)| = A. If a 
path in w(J) has an endpoint then this point belongs to B. Hence, if w(J) 
contains no undending paths, it is a linkage. 


Let now W be a wave in T. Define b(W) = {(2’, y”) : (2, y) € E[W]}U 
{(2') : a € ter[W]} U{(a', x2”): a € VIW]}. It is not hard to see that b(W) is 
a wave in b([). Conversely, a wave I in (I) corresponds to a warp w(J) in 
T, defined by E[w(Z)] = {(2, y) : (2, y”) € E[J]}, and the singleton paths in 
w(Z) are the singletons (a), where a € A and (a’) € I. 


Lemma 7. ter[w(I)] is A-B-separating. Every path in w(L) either starts in 
A or is non-starting. Hence, if w(I) contains no non-starting paths, it is a 
wave in IT. In the latter case, if I is a hindrance, so is w(I). 


Proof. Let P = #122...%, be an A-B path inT. If P is not met by ter[w(J)] 
then (z/,) ¢ I. Since the edge (a4, x4) is covered by ter|J], it follows that x4 € 
ter[I]. Let k be the last index for which x € ter|I]. If a, ¢ ter[w(Z)] then 
there exists y such that (x,,y") € I. Then the edge (2/,, x{,,) necessarily 
meets ter[I] at x¥,,, contradicting the choice of k. This shows that ter[w(J)| 
is A-B-separating. 

Let now Q € w(Z). If Q is a singleton x, then by the definition of w(I) we 
have x € A, so Q starts at A. If not, choose a vertex v = v9 € V(Q) \ ter(Q). 
Then vj ¢ ter[I], and hence the edge (vg, vg) is covered at vj. Hence there 
exists an edge (uj, vi) € J, and then (v1, v0) € E(Q). Repeating the same 
argument for v1, we obtain an edge (v2, v1) € E(Q). This process is either 
infinite, in which case Q is non-starting, or terminates when some vz is in A 
(and then the edge (vj, v;,) just does not exist.) This proves the second part 
of the lemma. 

The fact that if J is a hindrance then so is w(J) is obvious. 


From the above it follows that if ! does not contain unending paths then 
it is linkable if and only if b(T) is, and if it contains no non-starting path then 
it is hindered if and only if b(T) is. Since Conjecture 2 is true for bipartite 
graphs [2], this proves: 


Theorem 2. Conjecture 2, and hence also Conjecture 1, are true for graphs 
with no infinite paths. 


This was proved in [1] in a different way. 
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2. Safely Linking One Point 


For ¢ € V \ A we write [ - x for the web w(b(T) — 2”). In the language 
of webs, rather than that of bipartite graphs, it is the web obtained upon 
adding x to A, and deleting all edges going into x. For a subset X of V \ A 
we denote by P+ X the web w(b(T) — {a” : x © X}). 

A key fact in the discussion to follow is: 


Lemma 8. /fT is unhindered, v€ V\ A andT | v ts hindered, then there 
exists in T a wave U such that v is separated by ter|U] from B (possibly 
v € ter[U]). 


Proof. Let H be a hindrance in I'F v. If v is hindered in H (i.e., v ¢ in[H]), 
then H is a hindrance also in T. If not, then let u be the terminal point of 
the path R in H which starts at v, and let h be a point in A hindered by 
H. Let H’ = H \ {R}. If there is no H’-alternating path in T starting at h 
and ending at u then the set of edges in E[H’] participating in H’-alternating 
paths starting at h forms a hindrance in T (see [3] for a detailed proof of 
this fact). Hence, we may assume that there exists an H’-alternating path Q 
starting at h and ending at u. Applying Q to H (that is, taking the warp 
whose edge set is E[HJ|AE(Q)) yields then the desired wave U/. 


Corollary 1. If T is loose and x is a verter in V \ A from which B is 
reachable then TF x is unhindered. 


Lemma 9. /fT is hindered then so is TF v. 


Proof. Let H be a hindrance in T. If v € V[H] then HU {(v)} is a hindrance 
in TF v. If uv lies on some path H € H then H \ {H}U {vH} is a hindrance 
inTF v. 


Lemma 10. [fT is unhindered and W is a wave inT then for every path 
PEW the web T/P is unhindered. 


Proof. Suppose that there exists a hindrance H in T/P. Let 7 = W \ {P}, 
p = ter(P), let Q be the path in H starting at p, if such exists, and let K = 
H\{Q} (K =H if p ¢ in[H]). Let L=K + 7. Assume first that p ¢ in[H]. 
We claim that then £ is a hindrance in I, contrary to the assumption that 
T is unhindered. To see this, it is enough to show that L£ is a wave, since 
obviously in[£] 4 A. But this, in turn, is obvious, since in this case £ is just 
H t W, which is a wave. 

Assume next that p € in[H]. Let H’ = H \ {Q}U PQ and N = H' t W. 
We claim that Nis a wave (and then, obviously, a hindrance.) To see this, 
take an A-B path R. Let t be the last point on R which lies on a path from 
V[W]. Then, since W is a wave, t € ter[W]. Hence t ¢ V(P — ter(P)). But 
N'’=H*+ JT is a wave in T — (P —ter(P)), and hence the path tR contains 
a vertex from ter[N’] and hence also from ter|\V]. 
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Lemma 11. Jf T is unhindered and a € A then there exists a non-trivial 
finite path P starting at a such that T/P is unhindered. 


Proof. Let W be a maximal wave in T. If W contains a non-trivial path P 
starting at a then, by Lemma 10, P is the desired path. If not, then, since T 
is unhindered, there exists a vertex v which is connected to a in TZW and 
from which B is reachable. We claim that P = av is the desired path. This 
will clearly follow if we show that ['' v is unhindered. If this is not the case, 
then by Lemma 8 there exists a wave U in T which separates v from B. Since 
W does not separate v from B, W AU. But, by Lemma 2, this contradicts 
the maximality of W. 


Let S be an A-B-separating set of vertices in I. We shall denote by 
A— part(S) the set of all vertices which are separated by S from B, and by 
B-— part(S) the set of all vertices which are separated by S from A (Note 
that S$ is contained in both, and so are possibly other vertices.) We write Ig 
for the web (G[A— part($)], A, S) and T° for the web (G[B — part(S)], S, B). 
If W is a wave we write Py for Me,[(W] and P” for T4e,[W). 


Lemma 12. If S is A-B-separating and I is hindered then at least one of 
Tg or I is hindered. 


Proof. Let H be a hindrance in T. Let ¥ be the set of all maximal initial 
parts of paths in H which are contained in Ig, and let Y be the set of all 
paths of the form sH, where s € S$, H € H and s is the last vertex on H 
belonging to S. 

Suppose that ¥Y is not a hindrance in I'%. Then there exists a vertex 
s € S which is not separated from A by ter[¥]. Then the warp VU {(t) :t€ 
S\ {s}\ in[Y]} is a hindrance in Ig (since a path from s to B in I which 
avoids S\ {s}Uter|[)] could be combined with a path in I's from A to s which 
avoids ter[4] to yield a contradiction to the fact that H is a hindrance.) 


Lemma 13. [fT has no unending paths, and if it is unhindered, then for 
anya€A there exists an a-B path P such that . — P is unhindered. 


Proof. By Lemma 11 there exists a non-trivial path P, = Q, starting at a 
such that [/Q, is unhindered. Apply the lemma again, to ['/Q1, to obtain 
a path Py = Q1Q2 in T such that ['/P, is unhindered. Since [ does not 
contain unending paths, this process must halt at a certain point, and this 
can happen only when Py reaches B, for some k. Then P; is the desired 
path. 


The following theorem was (essentially) proved in [3]. But we would like 
to present here a somewhat different proof. The advantage of the new proof is 
that it makes better use of the bipartite conversion. It differs from the proof 
in [3] in that it --s vertices, rather than deletes them. Since it is the f-ing 
operation, and not deleting, which is used in the discussion of the uncountable 
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case in the following sections, this may indicate that there is a better chance 
of extending the proof to the uncountable case. 


Theorem 3. Let I be an unhindered web, and let a € A be such that there 
are only countably many a—B paths. Then there exists an a—B path P such 
that T — P is unhindered. 


Proof. Enumerate the a — B paths as (P,, P2, ...). We shall choose 
inductively webs [;, waves W; in T;, vertices vj; warps U; and sub-warps 
U; of U;(i < w), as follows. Let [9 =T, Wo and Uo the trivial wave in T, and 
Up = a. 

Let W, be a maximal wave in Ip, and let Ay = [9p ZW. By Lemma 1 
A, is loose. Let U; = W and let Ul, be the set of those paths in 4, which do 
not start at a. Now let 7; be first such that P;, does not meet V{?/;], and let 
v1 be the first vertex on P;, which does not belong to V [41]. Let (ui,v1) be 
the edge of P;, preceding v;. Then wu; € ter[W,], and since Aj is loose, by 


Catalan a, = Ay bon teauwhindered: 

Let W2 be a maximal wave in Ty, and define Ag = I, ZW. Let U2 be 
the concatenation of W, and Wp . Clearly, then, U2 is a wave in T'2. Let Up 
be the set of those paths in U2 which do not start at vo or v,. Let i2 be first 
such that P;, does not meet V [U4], and let v2 be the first vertex on P;, which 
does not belong to V [l/s]. Let (uz, v2) be the edge on P;, which precedes ve. 


d 
Since U/2 is a wave in T2, ue € ter|U2]. Hence, by Lemma 8 [2 oe Ao t v9 is 
unhindered. 
Continuing in this way, we obtain after w steps a wave YU in the web 


Ty ef T+ {vo, v1,.--}, which is the concatenation of all warps 1/;. It is divided 
into two parts: Z, which consists of those paths in U which begin at some v,;, 
and U =U \ Z, which is the “limit” of all U/; (i.e its edge set is the union of 
the edge sets of all U/;,) and whose starting points are all in A \ {a}. 


Assertion 1. If some path in Z ends in B (in particular if some v; is in 
B,) then the theorem holds. 


Proof. Suppose that some path Qo € Z ends in B. Let ko be such that 
Qo € Wr and let vp, = in(Qo) (possibly Qo = (vx,)). Let ux, be as in 
the construction above. Then u,, = ter(Q1) for some path Qi € W,, and 
in(Q1) = vp, for some kp < ky. Let uz, be as in the construction above, and 
take a path Q2 € W,,. which terminates at u,, and starts at v,~, for some 
kz < ko. Continuing this way we get a sequence of triples (vp,, Uz;, Qi), where 
the k; are descending. Eventually, k; = 0 for some j. 

Let P = Q;Qj;-1-..-Q1Qo. We claim that P is the desired path in the 
theorem. Assume, to the contrary, that [ — P is hindered. 

Write I= TF {vo,v1,...,ve,}- Let Zo = Ihy,,, and for each i < 7 
let B; = (Si-1)u, and UW; = (=;_1)“. By Lemma 9 the web II — P is 
hindered. Since Qo is a path in the wave W,, in T'y,, by Lemma 10 Tx, /Qo 
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is unhindered. Hence also V, — P, which is equal to Ty, — Q, is unhindered. 
It thus follows that =, — P is hindered. 

Again, by Lemma 12 this implies that either =, —P is hindered or V2—P 
is hindered. By the same argument as above, the second of these possibilities 
would imply that T;, /Q, id hindered, in contradiction to Lemma 10. Hence 
=. — P is hindered. Continuing this way we obtain that =; — P, which is 
just the trivial web I’, (i.e., the web having A as both its source set and its 
target set, and no edges,) is hindered, which is obviously false. This proves 
the assertion. 


We shall show that U/ is a wave. Since a ¢ in|U/], this will mean that it is 
actually a hindrance, contradicting the assumption that [is unhindered. In 
order to show that U/ is a wave, consider an A-B path T. Since W, is a wave 
inl, T meets ter[W]. Let x be the last vertex on T which belongs to ter[Wj] 
for some i. If « ¢ ter[U] then there exists 7 > 7 such that x € in[W,]. But 
since W; is a wave in T;, the path xT contains a vertex in ter[W,], contrary 
to the choice of i. Thus x € ter{l/]. It remains to show the impossibility of 
x € ter[Z]. Suppose that « = ter(Q) for some Q € Z. Let in(Q) = um, 
and suppose that v,, lies on the path P;. Then the path PjumQxT is one 
of the paths P,. But, by the choice of x, the part xT of this path does not 
contain a vertex from any V[W,,], and hence when the turn of P, arrived, we 
would have chosen from it all vertices of eT’, one by one, as vp-s. But then 
ter(T’) = vp for some p, contradicting Assertion 1. 


One case in which the conditions of Theorem 3 hold is that in which the 
outdegrees of all vertices are countable, so for webs with this property the 
theorem is known for all vertices a. Also, clearly, Lemma 3 is an immediate 
corollary. As already mentioned, the countable case of Conjecture 1 follows 
from this lemma directly. So the theorem yields a new proof of the countable 
case of Conjecture 1. 


3. Ni-Hindrances 


The key to the solution of the uncountable bipartite case of Erdés’ conjecture 
was in the definition of higher order hindrances, i.e., «-hindrances for regular 
uncountable cardinalities «. The course of the proof there was proving first 
that a bipartite web is linkable if it does not contain a hindrance or a k- 
hindrance for some regular uncountable cardinal « [6], and then proving that 
the existence of a «-hindrance implies the existence of a hindrance (in the 
old sense [2}). 

The notion of higher order hindrances can be carried over to general 
webs. Since the aim of this paper is to present general techniques rather than 
complete proofs, we shall be satisfied with presenting the notion of an Nj- 
hindrance, which means that the proofs will apply only to webs of size Xj. 
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The reader may take on faith the fact that the notion and proofs can be 
extended to general cardinalities, and he is referred to [6,2] for details in the 
bipartite case. 

For an ordinal 7 < &; an 7-ladder is a sequence L = (Ry : a < 7), where 
each “rung” R,, of the ladder is of one of three possible types (the webs T% 
mentioned in them are to be defined below) 


(1) A vertex vg € V(T'%) \ A(T) 

(2) A hindrance R,, in ®. 

(3) An infinite path Ry, which is the concatenation of paths in hindrances 
Re, 8 < @ appearing in rungs of type (2), and has not appeared as yet 
as a rung of type (8) (ie it is not R, for any y < a,¥y of type (3).) 


The ordinal 77 is called the height of the ladder. (If special mention of the 
ladder L is necessary, we shall write 7(Z), a remark applying to all notation 
referring to ladders.) The set of a-s of type (i) is denoted by ®;. The webs 
I are defined inductively: 


If a € ®; then Pet! is defined as T% + ug. 

If a € Sy then T°*! =T°%ZRy. 

If a € 63 then [°*! is obtained from T° by deleting in(R,) from its ’A- 
set (for the motivation of this definition see below, after the definition of 
Tq.) 


For limit a, '® is defined as the “limit” of the webs T’, 6 < a, in an 
obvious way: its ‘A’ set is the set of all those vertices which belong to A(I?) 
for cofinally many §-s in a, and similarly for the definition of its vertex and 
edge sets. Its ’B’ set is just B. 

Alongside with the the webs [@ we define warps ¥, = Ya(L) for each 
a < 7. Loosely speaking, Y, is the concatenation of all hindrances Rg, 6 < a. 
More precisely, we let 


E(Ya] = { EIR] : 8 < 0,6 € G2} \ U{EIRe] : 8 < 4,8 € 43}. 


The warp Y, is not defined solely by its edge set, because it may contain 
also singleton paths. It will be defined, however, once the set of its initial 
points is defined. We let in[V.] be the set of all vertices in AU {vg : 8 < a} 
which are not hindered in any hindrance Rg, € 2,8 < a and are not 
in(Rg) for any 6 € 3,8 <a. Also write Y, = V(= Y(L)). 

The following is easily proved by induction: 


Lemma 14. ter[Y,] separates A from B. 


From the inductive definition of the webs T° it is easy to see that T° 
is essentially P¢"el, the difference being that the vertex set of the latter 
may be larger—there may be vertices separated by ter|V.| from A, which 
are therefore in the vertex set of "+l, but are deleted in the process of 
forming [°. But for our purposes we may regard these two webs as being the 
same. 


A Few Remarks on a Conjecture of Erdés on the Infinite Version... 345 


Let Tg = Dyer a]: 

For v € V\A let p(v) = pz(v) be the first ordinal p for which v € V(T,). 

Rungs of type (3) appear in ladders since the initial point of an infinite 
path in Y has lost hope of being linked to B by Y, while it is separated from 
B by B(T,) (Lemma 14). So, it is a “problematic” vertex, which has to be 
specially taken care of. This complication does not arise, of course, in the 
case of webs with no unending paths, and as we shall see in Theorem 6 also 
not in the other main case discussed in this paper, namely that of webs with 
countable outdegrees. 


Notation 1. Let F(L) =U{Rq: a € 81}. For each a € ® let HD, be the 
set of points in A(T) which are hindered by R.. We also write HDAjq for 
the set of points in A which are the initial points of paths in Vo, terminating 
at HDg,. 


For a limit ordinal a let NF be the set of all infinite paths in Vy starting 
at A. 
Also let NFy = in[N FQ]. 


Definition 2. If L is of height 81 and ®2 or ®3 are stationary in X1 then 
L is called an &1-hindrance. 


One of the two main ingredients of the proof of Conjecture 1 goes over 
to general webs: 


Theorem 4. [fT contains an &y-hindrance then it is hindered. 


Proof. In [2] the theorem was proved for bipartite webs. We shall use this 
case of the theorem, but not directly. One has to go a bit into the proof in 
order to make it applicable to general webs. 

Let LD = (Ra : a < &1) be an Ny-hindrance. Assume, first, that @2(L) is 
stationary. Then DL corresponds to an Ny-hindrance N = (Ty : a < &i) in 
b([). By the main theorem of [2], N gives rise to a hindrance H in b(T). We 
would like to show that H can be chosen so that w(#) is a hindrance in T, 
i.e., so that it does not contain non-starting paths. 

Let M = b(E[Y(L))), ie., the set of edges in b([) corresponding to the 
edges in paths in the warp Y. Then M = E/Y(N)J, and it is a matching in 
b([). We write F = F(L) and BF = F(N). Obviously BF = b(F) 

The basic concept in the proof in [2] is that of popularity. 

Given a set of vertices U in any graph, a set P of paths in the graph is 
called U-joined if every two members of P meet only within U. A subset U 
of B(b(L)) is called popular if there exists a U-joined set {Zg : B € WU} of 
paths, where each Zg is an M-alternating path from some vertex u” of U to 
some vertex vu’ € HDg(N) and W C &y is stationary. 

By the usual abuse of notation we shall say that a vertex u” € B(b(T)) 
is popular if {u”} is popular. 

We shall now follow a construction given in [2]. 
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Let Uo be the set of all unpopular vertices in BF (that is, the set of 
all unpopular vertices u’,a € ®,.) Let Ky be the set of all vertices x’ 
connected to them, U; the set of all unpopular vertices in M[K,], K the set 
of all unpopular vertices connected to vertices in U,, U2 the set of unpopular 
vertices in M[K9], and so forth. Let, eventually, U = U{U; :i < wh, kK = 
U{K;:i<wh,andA=0(1)-U-K. 


Assertion 2. All points in A(A) are connected in b(T) only to points from 
B(A). 


Proof. This is clear, since by definition K is the set of points connected in 
b([) to U. 


Let M’ = Mjgra) and FR = B(A)\UM (“FR” stands for “free”). 
Clearly, FR = BF \U. 

The points in FR were popular in I’. The main idea of the proof in [2] is 
that they still remain so in A, although A is formed from T by the removal 
of vertices. In a somewhat informal manner, this can be put in: 


Assertion 3. Every point p in FR is still popular in A, namely it has a 
p-joined set of M’-alternating paths to “stationarily many” hindered points 
belonging to A(A) (i.e., points in A(A)\U M".) 


A little more formal, and stronger, is the following: 


Assertion 4. Let p € FR and let {X, : a € WV} be a p-joined set of M- 
alternating paths, where WV is stationary and each path Xq goes from p to 
HD,. Then {8:K meets Xg} is non-stationary. 


In [2] Assertion 4 was derived from an even stronger one, which we shall 
also use here: 


Assertion 5. The entire set U is unpopular. 


To see how Assertion 4 follows from Assertion 5, assume that the set O of 
B-s for which K meets Xg (say, at a point zg) is stationary. For each 6 € O 
choose a point ug € U connected to zg. Then the set of paths {ugzgXg : B € 
©} would show that U is popular. 

Another immediate corollary of Assertion 5 is: 


Assertion 6. The set of a-s for which HDaN A(A) #0 is stationary. 


This would clearly follow also from Assertion 3 if we knew that FR # 0. 
But this is not guaranteed. 

In the bipartite case, Assertion 3 is sufficient for the proof of Theorem 4. 
In fact, all that is used is that every point p € FR has a p-joined set of 
size X; of M’-alternating paths to points in A(A) \U M’. Since |F'R| < Nj, 
these alternating paths can be used to match all points of B(A) one by one 
into A(A). One then uses Fodor’s Lemma and Assertion 6 to prove that this 
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matching is strictly into. By Assertion 2 this matching provides the desired 
hindrance in b(T). 

An observation which will be used below is that -alternating paths in 
T are in one to one correspondence with M-alternating paths in b([), with 
the natural correspondence: an M-alternating path Q corresponds to a )- 
alternating path w(Q) whose edge set is w(E(Q)). In fact, this can be used 
as a definition of a -alternating path. (A detail worth noting is that when 
a )-alternating path Q lingers for more than one edge on a path from J), 
its corresponding path alternates between edges of M and edges of the form 
(x’, x"), which are not in M.) 

Write S = w(FR) and Y’ = w(M’). Then Y’ consists of fragments of 
paths from Y. Let Z be the set of paths in Y’ which start at A. Each s € S 
has Xi many Y’-alternating paths meeting only at s. Listing the points of S$ 
in a sequence of length at most w1, we can link them one by one using their 
y’-alternating paths, and sticking to the rule that a path in Y is never used 
in alternating paths pertaining to more than one point from S. 

The trouble is that if we are not careful in the application of the 
alternating paths, the warp which results may contain non-starting paths. 
To overcome this difficulty we prove: 


Assertion 7. Let s € S and let {Xg : 8 € WV} be an s-joined set of Y’- 
alternating paths, each Xg leading to HDg, and WV being stationary. Then 
there exists a stationary subset UV’ of V such that for each B € V' the paths 
from Y' which Xg meets are all in Z. 


Proof. We shall prove something even stronger, namely that the set of those 
B-s for which Xg meets a path not in Z is non-stationary. This set consists of 
two (possibly overlapping) parts. The first, =, consists of G-s for which Xg 
meets a path Qg € Y which does not start at A. The second, &2, consists of 
B-s for which Xg meets a path, say Yg, at a point yg such that there exists 
a point kg € K preceding yg on Yg. 

For each 6 € = the initial point of Qg is a vertex v = Ra for some a < P. 
Hence, if =, were stationary, by Fodor’s Lemma XN; many Xg-s would share 
the same @,. This is impossible, since the paths X¢g are disjoint and each Qg 
is finite. 

Assume next that =» is stationary. For each 8 € Ep» choose a vertex 
uz € U connected to kj, and witnessing the fact that k’ € K. Consider then 
the paths Jg = ugkgYgy3Xg. The paths b(Jg) in (Lf) then show that U is 
popular, contradicting Assertion 5. 


Let (ss : 6 < v <j) be a listing of all element of S. Choose inductively 
for each s5 a Z-alternating path Ps from s5 to some HD, 5)(L) which meets 
only paths in Z which were not met by any P¢,¢ < 6, does not contain edges 
from E|Y]\E[Z], and such that a(d) 4 a(¢) for ¢ < 6. Such choice is possible, 
by Assertion 7. Let H be the warp resulting from the application of all P;-s to 
Y. Since the paths Ps; correspond (by the correspondence b) to M’-alternating 
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paths in A, ter[H] = w(B(A)) and in[H] C w(A(A)). By Assertion 2 this 
means that H = b(E/H]) is a wave in b([). By Fodor’s Lemma there exists at 
least one a € ®2 such that no path P; reaches HD,(N). (In fact, there is no 
need to invoke Fodor’s Lemma for this, just choose the paths Ps so that this 
condition is fulfilled.) Hence H is, in fact, a hindrance. Since the paths Ps 
only use paths from Z to alternate on, and distinct ones at that, the paths 
in H all start at A. Hence, by Lemma 7, H. is the hindrance desired in the 
theorem. 

The case that ®3 is stationary is similar, but for simplicity of presentation 
it is better to work directly in T. For each a € 3 let P, be the infinite 
path which constitutes the rung R,. A vertex w € F is called popular if 
for a stationary subset V of ®3 there exist Y-alternating paths Qy,w € W 
from w to Py. (We demand that such a path ends when it reaches some 
vertex in Py). The proof then goes along the same lines as in the case of ®2 
stationary. 


We conjecture that also the other main ingredient of the proof in the 
bipartite case goes over to general webs, namely: 


Conjecture 4. If |V(T)| < Ni and TI contains no hindrance and no &1- 
hindrance then T is linkable. 


We are able to prove the conjecture only in the cases in which Conjecture 3 
is known. So, we have: 


Theorem 5. Conjecture 2 (and hence also Conjecture 1) is true for webs of 
size X1 with no unending paths or with countable outdeqrees. 


Proof. By Lemma 5 the theorem holds for webs in which A or B are 
countable. Hence we may assume that |A| = |B| = i. Enumerate V \ A 
as (Vg : 8 < 1). Define inductively the rungs Ra of a ladder L as follows. 
Let [° = LT. Suppose that Rs have been defined for all 5 < a. Let T® be 
defined as in the definition of ladders made above. First priority in the choice 
of rungs is given to type (3). That is, if there exists an infinite path in VF, 
which is not equal to Rg for any 8 < a, then choose such a path to be Rg. 

Second priority is given to type (2). So, if there does not exist an infinite 
path as above, but there exists a hindrance in I, choose Ra to be such a 
hindrance. 

If no path or hindrance as above exist, choose Ry to be the first vg in 
V\V(TQ). 

We proceed in this definition until all vertices in V have been covered. 
This, obviously, happens for some 7 < XN; For each v € V \ A write a(v) for 
the first a for which v € V(T,). 

Assume first that (ZL) < &,. Then the warp W consisting of all paths 
in Y(L) starting at A satisfies the condition that A \ in[W] is countable. 
Moreover, since all points of V([) \ A appear in the ladder L, ter[V(L)] must 
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cover all but countably many vertices in B. It follows that I’ satisfies the 
condition of Lemma 4, and hence also Conjecture 2. 

Thus we may assume that 7(L) < &;. By the assumption that T is &4- 
unhindered, there exists a club set IT such that ITM (®2U 3) = 0. 

The idea is to use IT to link [. Informally, this is done as follows: we link 
one point from A to B by a path P, using Lemma 13 and Theorem 3. Let 7 
be the first element of II for which all vertices in P are contained in I’,,. We 
wish to take care of the points in A which are not linked by Y,, into B(T;,), 
There are only countably many of those, and we link them one by one to 
B. At each step we take a possibly larger 7; € II so that I, contains the 
new paths. We add to the set of problematic points (those which have to be 
taken care of) all points in A which are hindered in some Ra, 7; > a € Il, 
and also N Fy, if a is a limit ordinal. After w steps we have taken care of all 
problematic points in ,, where z is the limit of all 7;. Since 7 € II, the web 
I” is unhindered. Also, NF, = 0. Hence we can turn over a new leaf—try to 
link points in [7. By the club-ness of II we can join all the partial linkages 
obtained in this procedure to obtain a linkage of A. 

Formally all this is a bit more cumbersome: 

Clearly, NF, is countable for every a < N,. For otherwise the rungs of 
L are of type (3) for all a < 8 < X,, making L an X,-hindrance. Similarly, 
HD, is countable, or else we could choose the rungs Rg to be of type (2) 
from a@ onwards. 

List the points of A as ag,9 < &1. 

For a subset Z of V \ A let 7(Z) be the minimal ordinal 7 € II such that 
mw > p(z) for every z € Z. 

By Lemma 13 and Theorem 3 there exists an ag — B path Po such that 
A, =I — Py is unhindered. Let t{ = m(V(Pp)), Let Ki be the set of all 
vertices in A \ {ao} which are initial vertices of paths from V[V(L)] which 
are met by Po, and let 


D, = Ky UHDA,1(L) UNF, U {a1} 


(For the definition of HDA see Notation 2.) Clearly, D, is countable, so we 
can order it in an w-sequence d;. Let v, be the first vertex in this list. There 
exists then a v; — B path P, such that Ag = A, — P, is unhindered. Let 
n? = t(V[{Ppo, Pi}]). Let Ke be the set of vertices from A \ {vo, v1} (where 
Uo = ao) which are initial vertices of paths from Y which are met by Po, Pi, 
and let 


Dz = KyU HDA,2 U (NFy2 \ NFy2) U {a2} 


Intersperse D2 in the even numbered places of d,, to obtain a list dz. Let v2 
be the first element in dz which does not belong to {vo, vi}. Choose a v2 — B 
path P, such that [3 = P2 — P, is unhindered. We continue in this way, 
and obtain sequences (v;), (74), (P;) of length w. Let 71 = sup{mi :i < wh. 
The interspersing of the sets D;, in the sequence of points to be linked to 
B is done so as to guarantee that each point is reached at some stage (so, 
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for example, D3 is interspersed in every third place, say, in dz). Then, after 
w steps, when the dust has settled, the warp No consisting of {P; : i < w} 
together with all paths in Y,, which connect A to B(I,,) and which do not 
meet any P,, links A into B(T;,). 

Now repeat the procedure with [™ replacing [’. This is possible, since 
m, € II, and hence [' is unhindered. This produces a linkage Nz of B(T™) 
into B([™?) for some m < m2 € II. 

Continuing this way, we obtain an N,-sequence of ordinals 7m € I, and 
linkages N¢ of A([',,) into B(I;,,,). For limit ¢ we just define m¢ = sup{7 : 
6 < ¢}. Since II is closed, a¢ € II. 

At each step ¢, the first ag in the list of elements of A which have not 
been linked as yet to B is not hindered in I’,., since this web is not hindered. 
So, it is the starting point of a path Q € Y,,. This path is not infinite, since 
if it is then, since TN 3 = 0, Q € NF«q for some a < ze. But then, by 
our construction, @g has been linked already to B at a previous stage. Let 
ay = ter(Q). Then a, € A(I,,). We put it first in the first list of vertices to 
be linked, so that after the ¢-th stage ag is linked to B. This guarantees that 
after X; steps all vertices of A are linked to B. 


Remark 1. The true content of the theorem, and what is actually proved, 
is that Conjecture 4 can be reduced to Conjecture 3 for webs of size Xi. We 
chose the above presentation since it states absolute results, rather than a 
result about implication. In fact, both cases mentioned in the theorem have 
simpler proofs. For webs with no unending path one can construct a ladder 
as in the proof of the theorem, and then link its ‘slices’ (defined by the club 
set II), and these linkages join up to form a linkage of T, since no unending 
path has been generated. In the case of countable outdegrees one can make do 
with a simpler kind of &%,-hindrances, as follows: 

Call a hindrance degenerate if all of its paths are singletons. A point in A 
hindered by a degenerate hindrance is just one from which B is unreachable. 
An &,-hindrance is called degenerate if all of it rungs Ra, a € ®2, are 
degenerate. Note that in such an ®-hindrance Y =), and hence there are no 
rungs of type (3). Similar arguments to those above (only simpler) show the 
following. 


Theorem 6. I[f|V(T)| < Xi, the outdegrees of all vertices inT are countable, 
and TV does not contain a hindrance or a degenerate \1-hindrance, then it is 
linkable. 


Of course, this follows from the previous theorems, since not being 
hindered suffices. What it really means is that the above proof can be 
simplified in this case. A version which does add new information, however, 
is this: 


Theorem 7. A web of size Xi with countable outdegrees is linkable if and 
only if every countable subset of A is linkable, and it does not contain a 
degenerate X,-hindrance. 
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4. A Remark on Necessary Conditions for Linkability 


Conjecture 2 will yield (if true) a sufficient condition for linkability in webs. 
The proof of the bipartite case of the conjecture gave, in fact, more than just 
a sufficient condition—a necessary and sufficient condition is known in this 
case. 

Podewski and Steffens [8] defined (in somewhat different terms) a special 
type of waves. A wave W is called tight if there is no warp U such that 
in[U] = in|[W], ter|U] Cc ter[W] and ter[U] 4 ter[W]. An obstruction is 
a tight hindrance. In [8] it was proved that a countable bipartite graph is 
linkable if and only if it is unobstructed. N1-obstructions are defined by taking 
the rungs Ra, a € ®2 to be obstructions, rather than hindrances. Again, it 
was proved [6] that a bipartite graph of size 1 is linkable if and only if it 
does not contain an obstruction or an N-obstruction. 

It is not even completely clear what should be the formulation of the 
necessary and sufficient condition for linkability in webs. Transferring the 
notions of tight waves and obstructions verbatim does not work, as the 
following example shows: 

Let At = {af ca < wi A? =da et < wh B= {sa < w} and 
C = {ce : i < w}. Also let A = A! U A? and V(G) = AUBUC. Let 
E(G) = A? x CUCx BU{(a}, ci): i < wp U{(a?, bj) : i < w}. Then the web 
A = (G, A, B) does not contain an obstruction in the above sense, and yet it 
is not linkable. What does exist in A is a tight wave J with a proper subwave 
T', which means to say that a proper subset of ter[T] is A-B-separating (take 
T = {(a?, bi): i < w}U {(a}, c) : i < w}). It is possible that this is the 
right notion for an “obstruction” in the case of webs. It will be interesting 
(and possibly of value for the proof of the general conjecture) to show that 
the non-existence of obstructions in this sense is sufficient for linkability in 
countable webs. 

Podewski and Steffens [8] noted that in a bipartite web I there exists 
a maximal linkable subset of A(T’) (the set of initial points of a maximal 
tight wave is such a set). The web A above provides a counterexample to 
this statement in the case of countable webs: it is not hard to show that a 
subset N of A is linkable if and only if A \ N is infinite, and clearly there is 
no maximal subset N of A satisfying this condition. 


Author’s Note, 2013 Edition 


Since the publication of this paper, the Erdés-Menger conjecture was solved: 
Aharoni, Ron and Berger, Eli (2009). “Menger’s Theorem for infinite 
graphs” , Inventiones Mathematicae 176: 162. doi:10.1007/s00222-008-0157-3. 
As foreseen in the present paper, the key was the proof of Lemma 3 for 
general (namely, uncountable) graphs. But the way from the lemma to the 
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theorem proved to be more intricate for cardinalities of size larger than Xy 
than for the X; case (which is done in this paper). 


As in the present paper, another key role was played by the bipartite 


conversion, that provides good insight into Menger’s theorem in the infinite 
case, as it does in the finite case. 
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Summary. Erdés and Rényi showed the paradoxical result that there is a unique 
(and highly symmetric) countably infinite random graph. This graph, and its 
automorphism group, form the subject of the present survey. 


1. Introduction 


In 1963, Erdés and Rényi [27] showed: 


Theorem 1. There exists a graph R with the following property. If a 
countable graph is chosen at random, by selecting edges independently with 
probability $ from the set of 2-element subsets of the vertex set, then almost 
surely (i.e., with probability 1), the resulting graph is isomorphic to R. 


This theorem, on first acquaintance, seems to defy common sense—a 
random process whose outcome is predictable. Nevertheless, the argument 
which establishes it is quite short. (It is given below.) Indeed, it formed a 
tailpiece to the paper of Erdés and Rényi, which mainly concerned the much 
less predictable world of finite random graphs. (In their book Probabilistic 
Methods in Combinatorics, Erdés and Spencer [28] remark that this result 
“demolishes the theory of infinite random graphs.” ) 

I will give the proof in detail, since it underlies much that follows. The 
key is to consider the following property, which a graph may or may not have: 


(*) Given finitely many distinct vertices U,,...,Um,V1,-.-,Un, there exists 
a verter z which is adjacent to uy,...,Um and nonadjacent to v1,...,Un- 


Often I will say, for brevity, “z is correctly joined”. Obviously, a graph 
satisfying (*) is infinite, since z is distinct from all of w1,...,Um,U1,---;Un- 
It is not obvious that any graph has this property. The theorem follows from 
two facts: 


Fact 1. With probability 1, a countable random graph satisfies (x). 
Fact 2. Any two countable graphs satisfying (*) are isomorphic. 


Proof (of Fact 1). We have to show that the event that (*) fails has 
probability 0, i.e., the set of graphs not satisfying (*) is a null set. For 
this, it is enough to show that the set of graphs for which («) fails for 
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some given vertices U1,...,Um,U1,---;Un is null. (For this deduction, we 
use an elementary lemma from measure theory: the union of countably 
many null sets is null. There are only countably many values of m and 
mn, and for each pair of values, only countably many choices of the vertices 
U1,-+-,Um),U1,-++,Un-) Now we can calculate the probability of this set. Let 
Z1,-.-,2n be vertices distinct from uj,...,Um,U1,---;Un- The probability 
that any z; is not correctly joined is 1 — oT} since these events are 
independent (for different z;), the probability that none of z,...,zn is 
correctly joined is (1 — 544)“. This tends to 0 as N — oo; so the event 
that no vertex is correctly joined does have probability 0. 


Note that, at this stage, we know that graphs satisfying (*) exist, though 
we have not constructed one—a typical “probabilistic existence proof”. Note 
also that “probability 4” is not essential to the proof; the same result holds 
if edges are chosen with fixed probability p, where 0 < p < 1. Some variation 
in the edge probability can also be permitted. 


Proof (of Fact 2). Let T, and Tz be two countable graphs satisfying (+). 
Suppose that f is a map from a finite set {21,...,2,} of vertices of Ty to T2, 
which is an isomorphism of induced subgraphs, and x,4+1 is another vertex of 
T,. We show that f can be extended to 2,41. Let U be the set of neighbours 
of p41 within {x1,...,a,}, and V = {a1,...,2,}\U. A potential image of 
XLn+1 Must be a vertex of [2 adjacent to every vertex in f(U) and nonadjacent 
to every vertex in f(V). Now property («) (for the graph Iz) guarantees that 
such a vertex exists. 

Now we use a model-theoretic device called “back-and-forth”. (This is 
often attributed to Cantor [20], in his characterization of the rationals as 
countable dense ordered set without endpoints. However, as Plotkin [58] has 
shown, it was not used by Cantor; it was discovered by Huntington [44] and 
popularized by Hausdorff [35].) 

Enumerate the vertices of Ty and T2, as {x1,22,...$ and {y1, yo,...} 
respectively. We build finite isomorphisms f, as follows. Start with fo = 9. 
Suppose that f,, has been constructed. If n is even, let m be the smallest 
index of a vertex of Ty not in the domain of f,,; then extend f,, (as above) 
to a map fr+1 with x, in its domain. (To avoid the use of the Axiom of 
Choice, select the correctly-joined vertex of [2 with smallest index to be the 
image of tm.) If n is odd, we work backwards. Let m be the smallest index 
of a vertex of [2 which is not in the range of f,; extend f, to a map fn41 
with ym in its range (using property («) for T;). 

Take f to be the union of all these partial maps. By going alternately 
back and forth, we guaranteed that every vertex of ['; is in the domain, and 
every vertex of Ig is in the range, of f. So f is the required isomorphism. 


The graph R holds as central a position in graph theory as Q does in 
the theory of ordered sets. It is surprising that it was not discovered long 
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before the 1960s! Since then, its importance has grown rapidly, both in its 
own right, and as a prototype for other theories. 


Remark 1. Results of Shelah and Spencer [65] and Hrushovski [42] suggest 
that there are interesting countable graphs which “control” the first-order 
theory of finite random graphs whose edge-probabilities tend to zero in 
specified ways. See Wagner [76], Winkler [77] for surveys of this. 


2. Some Constructions 


Erdos and Rényi did not feel it necessary to give an explicit construction of 
R; the fact that almost all countable graphs are isomorphic to R guarantees 
its existence. Nevertheless, such constructions may tell us more about R. Of 
course, to show that we have constructed R, it is necessary and sufficient to 
verify condition (*). 

I begin with an example from set theory. The downward Lowenheim- 
Skolem theorem says that a consistent first-order theory over a countable 
language has a countable model. In particular, there is a countable model of 
set theory (the Skolem paradox). 


Theorem 2. Let M be a countable model of set theory. Define a graph M* 
by the rule that x ~ y if and only if either x € y ory € x. Then M* is 
isomorphic to R. 


Proof. Let u1,...,;Um,V1,---,Un be distinct elements of M. Let « = {v1,..., 
Un} and z = {u1,...,Um,v}. We claim that z is a witness to condition (x). 
Clearly u; ~ z for all 7. Suppose that vu; ~ z. If vu; € z, then either vj = 
u; (contrary to assumption), or v; = x (whence x € 2, contradicting the 
Axiom of Foundation). If z € vj, then « € z € v; € x, again contradicting 
Foundation. 


Note how little set theory was actually used: only our ability to gather 
finitely many elements into a set (a consequence of the Empty Set, Pairing 
and Union Axioms) and the Axiom of Foundation. In particular, the Axiom 
of Infinity is not required. Now there is a familiar way to encode finite subsets 
of N as natural numbers: the set {a1,...,@,} of distinct elements is encoded 
as 2% +---+2%. This leads to an explicit description of R: the vertex set is 
N; x and y are adjacent if the x” digit in the base 2 expansion of y is a 1 or 
vice versa. This description was given by Rado [59, 60]. 

The next construction is more number-theoretic. Take as vertices the set 
P of primes congruent to 1 (mod 4). By quadratic reciprocity, if p,q € P, 


then (*) = 1 if and only if (2) = 1. (Here A(z) = 1” means that p is a 
quadratic residue (mod q).) We declare p and q adjacent if (2) =1. 
Let uy,,...,Um,V1,;---;Un € P. Choose a fixed quadratic residue a; 


(mod u,;) (for example, a; = 1), and a fixed non-residue b; (mod v;). By 
the Chinese Remainder Theorem, the congruences 
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cx=1 (mod4), c=a; (modu), x=b; (mod v,), 


have a unique solution x = zo (mod 4u1...Umv1...Un). By Dirichlet’s 
Theorem, there is a prime z satisfying this congruence. So property (*) holds. 

A set S of positive integers is called universal if, given k © N and T C 
{1,...,k}, there is an integer N such that, fori =1,...,k, 


N+7€S ifandonly if 7€T. 


(It is often convenient to consider binary sequences instead of sets. There is 
an obvious bijection, under which the sequence o and the set S correspond 
when (0; = 1) = (i € S)—thus @ is the characteristic function of S. Now 
a binary sequence o is universal if and only if it contains every finite binary 
sequence as a consecutive subsequence. ) 

Let S$ be a universal set. Define a graph with vertex set Z, in which x 
and y are adjacent if and only if |2 — y| € S. This graph is isomorphic to R. 
For let u1,...,Um,V1,---;Un be distinct integers; let | and L be the least and 
greatest of these integers. Let k = L—Il+1l and T = {u;-I4+1:i=1,...,m}. 
Choose N as in the definition of universality. Then z = /—1-— N has the 
required adjacencies. 

The simplest construction of a universal sequence is to enumerate all 
finite binary sequences and concatenate them. But there are many others. It 
is straightforward to show that a random subset of N (obtained by choosing 
positive integers independently with probability 5) is almost surely universal. 
(Said otherwise, the base 2 expansion of almost every real number in [0, 1] is 
a universal sequence.) 

Of course, it is possible to construct a graph satisfying (*) directly. For 
example, let [9 be the empty graph; if Ty, has been constructed, let [x41 
be obtained by adding, for each subset U of the vertex set of ,, a vertex 
z(U) whose neighbour set is precisely U. Clearly, the union of this sequence 
of graphs satisfies (*). 


3. Indestructibility 


The graph R is remarkably stable: if small changes are made to it, the 
resulting graph is still isomorphic to R. Some of these results depend on 
the following analogue of property (*), which appears stronger but is an 
immediate consequence of (*) itself. 


Proposition 1. Let uj,...,Um,V1,---,;Un be distinct vertices of R. Then 
the set 


Lae Se be fon SL wes Me oe Jorg = Tyan} 


is infinite; and the induced subgraph on this set is isomorphic to R. 
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Proof. It is enough to verify property («) for Z. So let uj,..., uh, U,,---5U; 
be distinct vertices of Z. Now the vertex z adjacent to w1,...,Un,U},.-.,U, 
and not to U1,...,Un,U},---,U;, belongs to Z and witnesses the truth of this 
instance of (*) there. 


The operation of switching a graph with respect to a set X of vertices 
is defined as follows. Replace each edge between a vertex of X and a vertex 
of its complement by a non-edge, and each such non-edge by an edge; leave 
the adjacencies within X or outside X unaltered. See Seidel [64] for more 
properties of this operation. 


Proposition 2. The result of any of the following operations on R is 
isomorphic to R: 


(a) Deleting a finite number of vertices; 
(b) Changing a finite number of edges into non-edges or vice versa; 
(c) Switching with respect to a finite set of vertices. 


Proof. In cases (a) and (b), to verify an instance of property (*), we use 
Proposition 1 to avoid the vertices which have been tampered with. For (c), if 
U ={ui,...,Um} and V = {v1,..., Un}, we choose a vertex outside X which 
is adjacent (in R) to the vertices of U \ X and VM X, and non-adjacent to 
those of UN X and V\ X. 


Not every graph obtained from R by switching is isomorphic to R. For 
example, if we switch with respect to the neighbours of a vertex z, then x is 
an isolated vertex in the resulting graph. However, if x is deleted, we obtain 
R once again! Moreover, if we switch with respect to a random set of vertices, 
the result is almost certainly isomorphic to R. 

R satisfies the pigeonhole principle: 


Proposition 3. If the vertex set of R is partitioned into a finite number of 
parts, then the induced subgraph on one of these parts is isomorphic to R. 


Proof. Suppose that the conclusion is false for the partition X,; U...U Xz 
of the vertex set. Then, for each i, property (*) fails in X;, so there are 
finite disjoint subsets U;,V; of X; such that no vertex of X; is “correctly 
joined” to all vertices of U;, and to none of V;. Setting U = U, U...UU, and 
V=V,U...UVz, we find that condition (x) fails in R for the sets U and V, 
a contradiction. 


Indeed, this property is characteristic: 


Proposition 4. The only countable graphs 1 which have the property that, 
if the vertex set is partitioned into two parts, then one of those parts induces 
a subgraph isomorphic to T, are the complete and null graphs and R. 


Proof. Suppose that T has this property but is not complete or null. Since 
any graph can be partitioned into a null graph and a graph with no isolated 
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vertices, we see that I has no isolated vertices. Similarly, it has no vertices 
joined to all others. 

Now suppose that [ is not isomorphic to R. Then we can find uj1,...,Um 
and v1,..., Un, such that («) fails, with m+n minimal subject to this. By the 
preceding paragraph, m+n > 1. So the set {uy,...,v,} can be partitioned 
into two non-empty subsets A and B. Now let X consist of A together with 
all vertices (not in B) which are not “correctly joined” to the vertices in 
A; and let Y consist of B together with all vertices (not in X) which are 
not “correctly joined” to the vertices in B. By assumption, X and Y form 
a partition of the vertex set. Moreover, the induced subgraphs on X and Y 
fail instances of condition (*) with fewer than m+n vertices; by minimality, 
neither is isomorphic to I, a contradiction. 


Finally: 
Proposition 5. R is isomorphic to its complement. 


For property (*) is clearly self-complementary. 


4. Graph-Theoretic Properties 


The most important property of R (and the reason for Rado’s interest) is 
that it is universal: 


Proposition 6. Every finite or countable graph can be embedded as an 
induced subgraph of R. 


Proof. We apply the proof technique of Fact 2; but, instead of back-and- 
forth, we just “go forth”. Let [ have vertex set {21,22,...}, and suppose 
that we have a map fn : {@1,.--,2n} — R which is an isomorphism of 
induced subgraphs. Let U and V be the sets of neighbours and non-neighbours 
respectively of 41 in {21,...,%,}. Choose z € R adjacent to the vertices 
of f(U) and nonadjacent to those of f(V), and extend f,, to map %p41 to z. 
The resulting map fn+1 is still an isomorphism of induced subgraphs. Then 
f =U fn is the required embedding. (The point is that, going forth, we only 
require that property (*) holds in the target graph.) 


In particular, R contains infinite cliques and cocliques. Clearly no finite 
clique or coclique can be maximal. There do exist infinite maximal cliques 
and cocliques. For example, if we enumerate the vertices of R as {x1,22,...}, 
and build a set S by So = 0,Sn41 = Sn U {xm} where m is the least index 
of a vertex joined to every vertex in S,, and S =) S;,, then S is a maximal 
clique. 

Dual to the concept of induced subgraph is that of spanning subgraph, 
using all the vertices and some of the edges. Not every countable graph is a 
spanning subgraph of R (for example, the complete graph is not). We have 
the following characterization: 
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Proposition 7. A countable graph T is isomorphic to a spanning subgraph 
of R if and only if, given any finite set {v1,...,Un} of vertices of T, there is 
a vertex z joined to none of v1,...,Un- 


Proof. We use back-and-forth to construct a bijection between the vertex 
sets of T and R, but when going back from R to I, we only require that 
nonadjacencies should be preserved. 


This shows, in particular, that every infinite locally finite graph is 
a spanning subgraph (so R contains 1-factors, one- and two-way infinite 
Hamiltonian paths, etc.). But more can be said. 

The argument can be modified to show that, given any non-null locally 
finite graph T, any edge of R lies in a spanning subgraph isomorphic to I. 
Moreover, as in the last section, if the edges of a locally finite graph are 
deleted from R, the result is still isomorphic to R. Now let T),T2,... be 
given non-null locally finite countable graphs. Enumerate the edges of R, as 
{e1, e2,...}. Suppose that we have found edge-disjoint spanning subgraphs 
of R isomorphic to Ty,...,0,. Let m be the smallest index of an edge of R 
lying in none of these subgraphs. Then we can find a spanning subgraph of 
R-—(l,U---UT,,) containing e,, and isomorphic to T,41. We conclude: 


Proposition 8. The edge set of R can be partitioned into spanning subgraphs 
isomorphic to any given countable sequence of non-null countable locally finite 
graphs. 


In particular, R has a 1-factorization, and a partition into Hamiltonian 
paths. 


5. Homogeneity and Categoricity 


We come now to two model-theoretic properties of R. These illustrate 
two important general theorems, the Engeler—Ryll-Nardzewski—Svenonius 
theorem and Fraissé’s theorem. The context is first-order logic; so a structure 
is a set equipped with a collection of relations, functions and constants whose 
names are specified in the language. If there are no functions or constants, we 
have a relational structure. The significance is that any subset of a relational 
structure carries an induced substructure. (In general, a substructure must 
contain the constants and be closed with respect to the functions.) 

Let M be a relational structure. We say that M is homogeneous if every 
isomorphism between finite induced substructures of M can be extended to 
an automorphism of M. 


Proposition 9. R is homogeneous. 


Proof. In the proof of Fact 2, the back-and-forth machine can be started with 
any given isomorphism between finite substructures of the graphs TI; and Lo, 
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and extends it to an isomorphism between the two structures. Now, taking 
IT, and [, to be R gives the conclusion. 


Fraissé [30] observed that Q (as an ordered set) is homogeneous, and 
used this as a prototype: he gave a necessary and sufficient condition for the 
existence of a homogeneous structure with prescribed finite substructures. 
Following his terminology, the age of a structure M is the class of all 
finite structures embeddable in M. A class C of finite structures has the 
amalgamation property if, given A, Bj, By € C and embeddings f; : A> B, 
and fz : A > Bo, there exists C € C and embeddings g; : B, — C and 
g2 : By > C such that fig: = foge. (Less formally, if the two structures 
B,, Bz have isomorphic substructures A, they can be “glued together” so 
that the copies of A coincide, the resulting structure C' also belonging to the 
class C.) We allow A = @ here. 


Theorem 3. (a) A class C of finite structures (over a fixed relational 
language)is the age of a countable homogeneous structure M if and 
only if C is closed under isomorphism, closed under taking induced 
substructures, contains only countably many non-isomorphic structures, 
and has the amalgamation property. 

(b) If the conditions of (a) are satisfied, then the structure M is unique up 
to isomorphism. 


A class C having the properties of this theorem is called a Fraissé class, 
and the countable homogeneous structure M whose age is C is its Fraissé 
limit. The class of all finite graphs is Fraissé class; its Fraissé limit is R. The 
Fraissé limit of a class C is characterized by a condition generalizing property 
(x): If A and B are members of the age of M with A C B and |B| = |A| +1, 
then every embedding of A into M can be extended to an embedding of B 
into M. 

In the statement of the amalgamation property, when the two structures 
By, Bz are “glued together”, the overlap may be larger than A. We say 
that the class C has the strong amalgamation property if this doesn’t occur; 
formally, if the embeddings g1,g2 can be chosen so that, if big, = bege, 
then there exists a € A such that 6; = af; and bg = afg. This property is 
equivalent to others we have met. 


Proposition 10. Let M be the Fraissé limit of the class C, and G = 

Aut(M). Then the following are equivalent: 

(a) C has the strong amalgamation property; 

(b) M\ A=M for any finite subset A of M; 

(c) The orbits of Ga on M \ A are infinite for any finite subset A of M, 
where G', is the setwise stabiliser of A. 


See Cameron [10], El-Zahar and Sauer [24]. 


The Random Graph 361 


A structure M is called Xo-categorical if any countable structure satisfying 
the same first-order sentences as M is isomorphic to M. (We must specify 
countability here: the upward Lowenheim-Skolem theorem shows that, if 
is infinite, then there are structures of arbitrarily large cardinality which 
satisfy the same first-order sentences as M..) 


Proposition 11. R is Xo-categorical. 


Proof. Property (*) is not first-order as it stands, but it can be translated 
into a countable set of first-order sentences Om,» (for m,n € N), where om.n 
is the sentence 


eran (EME A gem U Le ta) 


Once again this is an instance of a more general result. An n-type in 
a structure M is an equivalence class of n-tuples, where two tuples are 
equivalent if they satisfy the same (n-variable) first-order formulae. Now 
the following theorem was proved by Engeler [25], Ryll-Nardzewski [61] and 
Svenonius [68]: 


Theorem 4. For a countable first-order structure M, the following condi- 
tions are equivalent: 


(a) M is Xo-categorical; 

(b) M has only finitely many n-types, for every n; 

(c) The automorphism group of M has only finitely many orbits on M™, for 
every Nn. 


Note that the equivalence of conditions (a) (axiomatizability) and (c) 
(symmetry) is in the spirit of Klein’s Erlanger Programm. The fact that 


R satisfies (c) is a consequence of its homogeneity, since (71,...,%,) and 
(Y1,---,Yn) lie in the same orbit of Aut(R) if and only if the map (2; > y;) 
(i = 1,...,n) is an isomorphism of induced subgraphs, and there are only 


finitely many n-vertex graphs. 


Remark 2. The general definition of an n-type in first-order logic is more 
complicated than the one given here: roughly, it is a maximal set of n-variable 
formulae consistent with a given theory. I have used the fact that, in an No- 
categorical structure, any n-type is realized (i.e., satisfied by some tuple)— 
this is a consequence of the Géddel-Henkin completeness theorem and the 
downward Lowenheim-Skolem theorem. See Hodges [39] for more details. 


Some properties of R can be deduced from either its homogeneity or 
its No-categoricity. For example, Proposition 6 generalizes. We say that a 
countable relational structure M is universal (or rich for its age, in Fraissé’s 
terminology [31]) if every countable structure N whose age is contained in 
that of M (i.e., which is younger than M) is embeddable in M. 


Theorem 5. If M is either No-categorical or homogeneous, then it is 
universal. 
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The proof for homogeneous structures follows that of Proposition 6, 
using the analogue of property (*) described above. The argument for Nog- 
categorical structures is a bit more subtle, using Theorem 5.4 and Konig’s 
Infinity Lemma: see Cameron [11]. 


6. First-Order Theory of Random Graphs 


The graph R controls the first-order theory of finite random graphs, in a 
manner I now describe. This theory is due to Glebskii et al. [34], Fagin [29], 
and Blass and Harary [4]. A property P holds in almost all finite random 
graphs if the proportion of N-vertex graphs which satisfy P tends to 1 as 
N — oo. Recall the sentences om, which axiomatize R. 


Theorem 6. Let 6 be a first-order sentence in the language of graph theory. 
Then the following are equivalent: 


(a) 0 holds in almost all finite random graphs; 
(b) 6 holds in the graph R; 
(c) 6 is a logical consequence of {Om m :m,n € N}. 


Proof. The equivalence of (b) and (c) is immediate from the Gédel—Henkin 
completeness theorem for first-order logic and the fact that the sentences 
Om,n axiomatize R. 

We show that (c) implies (a). First we show that om,» holds in almost all 
finite random graphs. The probability that it fails in an N-vertex graph is not 
greater than N™+"(1— =457)N~-™"™, since there are at most N+” ways of 
choosing m+n distinct points, and (1— som)” ~™=—" is the probability that 
no further point is correctly joined. This probability tends to 0 as N > co. 

Now let 6 be an arbitrary sentence satisfying (c). Since proofs in first-order 
logic are finite, the deduction of 6 involves only a finite set % of sentences 
Om,n- It follows from the last paragraph that almost all finite graphs satisfy 
the sentences in ©; so almost all satisfy @ too. 

Finally, we show that not (c) implies not (a). If (c) fails, then 6 doesn’t 
hold in R, so (=@) holds in R, so (8) is a logical consequence of the 
sentences Om,n. By the preceding paragraph, (0) holds in almost all random 
graphs. 


The last part of the argument shows that there is a zero-one law: 


Corollary 1. Let 6 be a sentence in the language of graph theory. Then 
either 6 holds in almost all finite random graphs, or it holds in almost none. 


It should be stressed that, striking though this result is, most interesting 
graph properties (connectedness, hamiltonicity, etc.) are not first-order, and 
most interesting results on finite random graphs are obtained by letting the 
probability of an edge tend to zero in a specified manner as N — ov, rather 
than keeping it constant (see Bollobas [5]). Nevertheless, we will see a recent 
application of Theorem 6 later. 
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7. Measure and Category 


When the existence of an infinite object can be proved by a probabilistic 
argument (as we did with R in Sect. 1), it is often the case that an alternative 
argument using the concept of Baire category can be found. In this section, I 
will sketch the tools briefly. See Oxtoby [55] for a discussion of measure and 
Baire category. 

In a topological space, a set is dense if it meets every nonempty open set; 
a set is residual if it contains a countable intersection of open dense sets. The 
Baire category theorem states: 


Theorem 7. In a complete metric space, any residual set is non-empty. 


(The analogous statement for probability is that a set which contains a 
countable intersection of sets of measure 1 is non-empty. We used this to 
prove Fact 1.) 

The simplest situation concerns the space 2N of all infinite sequences of 
zeros and ones. This is a probability space, with the “coin-tossing measure” — 
this was the basis of our earlier discussion—and also a complete metric space, 
where we define d(x,y) = s¢ if the sequences x and y agree in positions 
0,1,...,2 — 1 and disagree in position n. Now the topological concepts 
translate into combinatorial ones as follows. A set S of sequences is open 
if and only if it is finitely determined, i.e., any x € S has a finite initial 
segment such that all sequences with this initial segment are in S. A set S$ 
is dense if and only if it is always reachable, i.e., any finite sequence has a 
continuation lying in S. Now it is a simple exercise to prove the Baire category 
theorem for this space, and indeed to show that a residual set is dense and 
has cardinality 28°. We will say that “almost all sequences have property P 
(in the sense of Baire category)” if the set of sequences which have property 
P is residual. 

We can describe countable graphs by binary sequences: take a fixed 
enumeration of the 2-element sets of vertices, and regard the sequence as 
the characteristic function of the edge set of the graph. This gives meaning 
to the phrase “almost all graphs (in the sense of Baire category)”. Now, by 
analogy with Fact 1, we have: 


Fact 3. Almost all countable graphs (in the sense of either measure or Baire 
category) have property (x). 


The proof is an easy exercise. In fact, it is simpler for Baire category than 
for measure—no limit is required! 

In the same way, almost all binary sequences (in either sense) are universal 
(as defined in Sect. 2). 

A binary sequence defines a path in the binary tree of countable height, 
if we start at the root and interpret 0 and 1 as instructions to take the left or 
right branch at any node. More generally, given any countable tree, the set of 
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paths is a complete metric space, where we define the distance between two 
paths to be oa if they first split apart at level n in the tree. So the concept 
of Baire category is applicable. The combinatorial interpretation of open and 
dense sets is similar to that given for the binary case. 

For example, the age of a countable relational structure M can be 
described by a tree: nodes at level n are structures in the age which have point 
set {0,1,...,n —1}, and nodes Xy, X41 at levels n and n+ 1 are declared 
to be adjacent if the induced structure of X,41 on the set {0,1,...,2—1} 
is X,. A path in this tree uniquely describes a structure N on the natural 
numbers which is younger than M, and conversely. Now Fact 3 generalizes 
as follows: 


Proposition 12. If M is a countable homogeneous relational structure, then 
almost all countable structures younger than M are isomorphic to M. 


It is possible to formulate analogous concepts in the measure-theoretic 
framework, though with more difficulty. But the results are not so straight- 
forward. For example, almost all finite triangle-free graphs are bipartite (a 
result of Erdés, Kleitman and Rothschild [26]); so the “random countable 
triangle-free graph” is almost surely bipartite. (In fact, it is almost surely 
isomorphic to the “random countable bipartite graph”, obtained by taking 
two disjoint countable sets and selecting edges between them at random.) 

A structure which satisfies the conclusion of Proposition 12 is called 
ubiquitous (or sometimes ubiquitous in category, if we want to distinguish 
measure-theoretic or other forms of ubiquity). Thus the random graph is 
ubiquitous in both measure and category. See Bankston and Ruitenberg [2] 
for further discussion. 


8. The Automorphism Group 


8.1. General Properties 


From the homogeneity of R (Proposition 9), we see that it has a large 
and rich group of automorphisms: the automorphism group G = Aut(R) 
acts transitively on the vertices, edges, non-edges, etc.—indeed, on finite 
configurations of any given isomorphism type. In the language of permutation 
groups, it is a rank 3 permutation group on the vertex set, since it has three 
orbits on ordered pairs of vertices, viz., equal, adjacent and non-adjacent 
pairs. Much more is known about G; this section will be the longest so far. 
First, the cardinality: 


Proposition 13. | Aut(R)| = 2*°. 


This is a special case of a more general fact. The automorphism group of 
any countable first-order structure is either at most countable or of cardinality 
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2X0, the first alternative holding if and only if the stabilizer of some finite 
tuple of points is the identity. 
The normal subgroup structure was settled by Truss [71]: 


Theorem 8. Aut(R) is simple. 


Truss proved a stronger result: if g and h are two non-identity elements 
of Aut(R), then h can be expressed as a product of five conjugates of g 
or g-'. (This clearly implies simplicity.) Recently Macpherson and Tent 
[49] gave a different proof of simplicity which applies in more general 
situations. 

Truss also described the cycle structures of all elements of Aut(R). 

A countable structure M is said to have the small index property if any 
subgroup of Aut(/) with index less than 2*° contains the pointwise stabilizer 
of a finite set of points of /; it has the strong small index property if any 
such subgroup lies between the pointwise and setwise stabilizer of a finite set 
(see Truss [72]). Hodges et al. [40] and Cameron [13] showed: 


Theorem 9. R has the strong small index property. 


The significance of this appears in the next subsection. It is also related 
to the question of the reconstruction of a structure from its automorphism 
group. For example, Theorem 9 has the following consequence: 


Corollary 2. LetT be a graph with fewer than 2®° vertices, on which Aut(R) 
acts transitively on vertices, edges and non-edges. Then T is isomorphic to 
R (and the isomorphism respects the action of Aut(R)). 


8.2. Topology 


The symmetric group Sym(X) on an infinite set X has a natural topology, 
in which a neighbourhood basis of the identity is given by the pointwise 
stabilizers of finite tuples. In the case where X is countable, this topology is 
derived from a complete metric, as follows. Take X =N. 

Let m(g) be the smallest point moved by the permutation g. Take the 
distance between the identity and g to be max{27™(9), a—-m(a~")1, Finally, 
the metric is translation-invariant, so that d(f,g) = d(fg~+,1). 


Proposition 14. Let G be a subgroup of the symmetric group on a countable 
set X. Then the following are equivalent: 


(a) G is closed in Sym(X); 
(b) G is the automorphism group of a first-order structure on X ; 
(c) G is the automorphism group of a homogeneous relational structure on X. 


So automorphism groups of homogeneous relational structures such as R 
are themselves topological groups whose topology is derived from a complete 
metric. 
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In particular, the Baire category theorem applies to groups like Aut(R). 
So we can ask: is there a “typical” automorphism? Truss [73] showed the 
following result. 


Theorem 10. There is a conjugacy class which is residual in Aut(R). Its 
members have infinitely many cycles of each finite length, and no infinite 
cycles. 


Members of the residual conjugacy class (which is, of course, unique) 
are called generic automorphisms of R. I outline the argument. Each of the 
following sets of automorphisms is residual: 


(a) Those with no infinite cycles; 

(b) Those automorphisms g with the property that, if [ is any finite graph 
and f any isomorphism between subgraphs of IT, then there is an 
embedding of I into R in such a way that g extends f. 


(Here (a) holds because the set of automorphisms for which the first n points 
lie in finite cycles is open and dense.) In fact, (b) can be strengthened; we 
can require that, if the pair (I, f) extends the pair ((, fo) (in the obvious 
sense), then any embedding of Ip into R such that g extends fo can be 
extended to an embedding of T such that g extends f. Then a residual set of 
automorphisms satisfy both (a) and the strengthened (b); this is the required 
conjugacy class. 

Another way of expressing this result is to consider the class C of finite 
structures each of which is a graph I’ with an isomorphism f between 
two induced subgraphs (regarded as a binary relation). This class satisfies 
Fraissé’s hypotheses, and so has a Fraissé limit M. It is not hard to show 
that, as a graph, Mis the random graph R; arguing as above, the map f can 
be shown to be a (generic) automorphism of R. 

More generally, Hodges et al. [40] showed that there exist “generic n- 
tuples” of automorphisms of R, and used this to prove the small index 
property for R; see also Hrushovski [41]. The group generated by a generic 
n-tuple of automorphisms is, not surprisingly, a free group; all its orbits are 
finite. In the next subsection, we turn to some very different subgroups. 

To conclude this section, we revisit the strong small index property. Recall 
that a neighbourhood basis for the identity consists of the pointwise stabilisers 
of finite sets. If the strong small index property holds, then every subgroup 
of small index (less than 2*°) contains one of these, and so is open. So we can 
take the subgroups of small index as a neighbourhood basis of the identity. 
So we have the following reconstruction result: 


Proposition 15. If M is a countable structure with the strong small index 
property (for example, R), then the structure of Aut(M) as topological group 
is determined by its abstract group structure. 
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8.3. Subgroups 


Another field of study concerns small subgroups. To introduce this, we 
reinterpret the last construction of R in Sect.2. Recall that we took a 
universal set S C N, and showed that the graph ['(S) with vertex set Z, 
in which z and y are adjacent whenever |z — y| € S, is isomorphic to R. Now 
this graph admits the “shift” automorphism x +> x + 1, which permutes the 
vertices in a single cycle. Conversely, let g be a cyclic automorphism of R. 
We can index the vertices of R by integers so that g is the map rH «+1. 
Then, if S = {n € N: n~ 0}, we see that x ~ y if and only if |jx—y| € S, and 
that S' is universal. A short calculation shows that two cyclic automorphisms 
are conjugate in Aut(R) if and only if they give rise to the same set S. Since 
there are 2*° universal sets, we conclude: 


Proposition 16. R has 2°° non-conjugate cyclic automorphisms. 


(Note that this gives another proof of Proposition 13.) 

Almost all subsets of N are universal—this is true in either sense discussed 
in Sect.7. The construction preceding Proposition 16 shows that graphs 
admitting a given cyclic automorphism correspond to subsets of N; so almost 
all “cyclic graphs” are isomorphic to R. What if the cyclic permutation is 
replaced by an arbitrary permutation or permutation group? The general 
answer is unknown: 


Conjecture 1. Given a permutation group G on a countable set, the 
following are equivalent: 


(a) Some G-invariant graph is isomorphic to R; 
(b) A random G-invariant graph is isomorphic to R with positive probability. 


A random G-invariant graph is obtained by listing the orbits of G on 
the 2-subsets of the vertex set, and deciding randomly whether the pairs 
in each orbit are edges or not. We cannot replace “positive probability” by 
“probability 1” here. For example, consider a permutation with one fixed 
point xz and two infinite cycles. With probability s, x is joined to all or none 
of the other vertices; if this occurs, the graph is not isomorphic to R. However, 
almost all graphs for which this event does not occur are isomorphic to R. 
It can be shown that the conjecture is true for the group generated by a 
single permutation; and Truss’ list of cycle structures of automorphisms can 
be re-derived in this way. 

Another interesting class consists of the regular permutation groups. 
A group is regular if it is transitive and the stabilizer of a point is the identity. 
Such a group G can be considered to act on itself by right multiplication. 
Then any G-invariant graph is a Cayley graph for G; in other words, there is 
a subset S of G, closed under inverses and not containing the identity, so that 
x and y are adjacent if and only if zy~! € S$. Now we can choose a random 
Cayley graph for G by putting inverse pairs into S with probability $. It is 
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not true that, for every countable group G, a random Cayley graph for G 
is almost surely isomorphic to R. Necessary and sufficient conditions can be 
given; they are somewhat untidy. I will state here a fairly general sufficient 
condition. 

A square-root set in G is a set 


Ja={teG:2? =a}; 
it is principal if a = 1, and non-principal otherwise. 


Proposition 17. Suppose that the countable group G cannot be expressed as 
the union of finitely many translates of non-principal square-root sets and a 
finite set. Then almost all Cayley graphs for G are isomorphic to R. 


This proposition is true in the sense of Baire category as well. In the 
infinite cyclic group, a square-root set has cardinality at most 1; so the earlier 
result about cyclic automorphisms follows. See Cameron and Johnson [15] for 
further details. 


8.4. Overgroups 


There are a number of interesting overgroups of Aut(R) in the symmetric 
group on the vertex set X of R. 

Pride of place goes to the reducts, the overgroups which are closed in the 
topology on Sym(X) (that is, which are automorphism groups of relational 
structures which can be defined from R without parameters). These were 
classified by Simon Thomas [69]. 

An anti-automorphism of R is an isomorphism from R to its complement; 
a switching automorphism maps R to a graph equivalent to R by switching. 
The concept of a switching anti-automorphism should be clear. 


Theorem 11. There are exactly five reducts of R, viz.: A = Aut(R); the 
group D of automorphisms and anti-automorphisms of R; the group S' of 
switching automorphisms of R; the group B of switching automorphisms and 
anti-automorphisms of R; and the symmetric group. 


Remark 3. The set of all graphs on a given vertex set is a Zo-vector space, 
where the sum of two graphs is obtained by taking the symmetric difference 
of their edge sets. Now complementation corresponds to adding the complete 
graph, and switching to adding a complete bipartite graph. Thus, it follows 
from Theorem 11 that, if G is a closed supergroup of Aut(R), then the set of 
all images of R under G is contained in a coset of a subspace W(G) of this 
vector space. (For example, W(B) consists of all complete bipartite graphs 
and all unions of at most two complete graphs.) Moreover, these subspaces are 
invariant under the symmetric group. It is remarkable that the combinatorial 
proof leads to this algebraic conclusion. 
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Here is an application due to Cameron and Martins [18], which draws 
together several threads from earlier sections. Though it is a result about 
finite random graphs, the graph R is inextricably involved in the proof. 

Let F be a finite collection of finite graphs. For any graph T, let F(T) 
be the hypergraph whose vertices are those of [, and whose edges are the 
subsets which induce graphs in F. To what extent does F(T’) determine I? 


Theorem 12. Given F, one of the following possibilities holds for almost 
all finite random graphs T: 


(a) F(T) determines T uniquely; 

(b) F(L) determines T up to complementation; 

(c) F(T) determines T up to switching; 

(d) F(T) determines T up to switching and/or complementation; 


(e) F(T) determines only the number of vertices of T. 


I sketch the proof in the first case, that in which F is not closed 
under either complementation or switching. We distinguish two first-order 
languages, that of graphs and that of hypergraphs (with relations of the 
arities appropriate for the graphs in F). Any sentence in the hypergraph 
language can be “translated” into the graph language, by replacing “F is an 
edge” by “the induced subgraph on FE is one of the graphs in F”. 

By the case assumption and Theorem 11, we have Aut(7(R)) = Aut(R). 
Now by Theorem 4, the edges and non-edges in R are 2-types in F(R), so 
there is a formula ¢(z, y) (in the hypergraph language) such that x ~ y in R 
if and only if ¢(a,y) holds in F(R). If ¢* is the “translation” of ¢, then R 
satisfies the sentence 


(Var, y)((x ~ y)  $*(z,y))- 


By Theorem 6, this sentence holds in almost all finite graphs. Thus, in almost 
all finite graphs, I’, vertices x and y are joined if and only if ¢(x, y) holds in 
F(T). So F(L) determines [ uniquely. 

By Theorem 11, Aut(#(R)) must be one of the five possibilities listed; 
in each case, an argument like the one just given shows that the appropriate 
conclusion holds. 

There are many interesting overgroups of Aut(R) which are not closed, 
some of which are surveyed (and their inclusions determined) in a forthcoming 
paper of Cameron et al. [16]. These arise in one of two ways. 

First, we can take automorphism groups of non-relational structures, such 
as hypergraphs with infinite hyperedges (for example, take the hyperedges to 
be the subsets of the vertex set which induce subgraphs isomorphic to R), or 
topologies or filters (discussed in the next section). Second, we may weaken 
the notion of automorphism. For example, we have a chain of subgroups 


Aut(R) < Auti(R) < Aute(R) < Auts(R) < Sym(V(R)) 


with all inclusions proper, where 
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e Auti(R) is the set of permutations which change only finitely many 
adjacencies (such permutations are called almost automorphisms of R); 

e Aut2(R) is the set of permutations which change only finitely many 
adjacencies at any vertex of R; 

e Auts(R) is the set of permutations which change only finitely many 
adjacencies at all but finitely many vertices of R. 


All these groups are highly transitive, that is, given any two n-tuples 
(V1,---,;Un) and (wi,...,Wn) of distinct vertices, there is an element of 
the relevant group carrying the first tuple to the second. This follows from 
Aut;(R) by the indestructibility of R. If Ry and Rg are the graphs obtained 
by deleting all edges within {v),..., un} and within {w1,..., wn} respectively, 
then R; and Rg are both isomorphic to R. By homogeneity of R, there is 
an isomorphism from R, to Ro mapping (v1,...,Un) to (wi,..., Wn); clearly 
this map is an almost-automorphism of R. 

Indeed, any overgroup of R which is not a reduct preserves no non-trivial 
relational structure, and so must be highly transitive. 


9. Topological Aspects 


There is a natural way to define a topology on the vertex set of R: we 
take as a basis for the open sets the set of all finite intersections of vertex 
neighbourhoods. It can be shown that this topology is homeomorphic to Q 
(using the characterization of Q as the unique countable, totally disconnected, 
topological space without isolated points, due to Sierpitiiski [66], see also 
Neumann [54]). Thus: 


Proposition 18. Aut(R) is a subgroup of the homeomorphism group of Q. 
This is related to a theorem of Mekler [50]: 


Theorem 13. A countable permutation group G is embeddable in the 
homeomorphism group of Q if and only if the intersection of the supports 
of any finite number of elements of G is empty or infinite. 


Here, the support of a permutation is the set of points it doesn’t fix. Now 
of course Aut(R) is not countable; yet it does satisfy Mekler’s condition. 
(If x is moved by each of the automorphisms g1,...,gn, then the infinitely 
many vertices joined to x but to none of xg1,..., gn are also moved by these 
permutations. ) 

The embedding in Proposition 18 can be realised constructively: the 
topology can be defined directly from the graph. Take a basis for the open 
sets to be the sets of witnesses for our defining property (*); that is, sets of 
the form 


Z(U,V) ={z € V(R): Vue U)(zn~u)A(WEV)(z £v)} 
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for finite disjoint sets U and V. Now given u # v, there is a point 
z € Z({u}, {v}); so the open neighbourhood of z is open and closed in the 
topology and contains u but not v. So the topology is totally disconnected. It 
has no isolated points, so it is homeomorphic to Q, by Sierpinski’s Theoreom. 
There is another interesting topology on the vertex set of R, which can be 
defined in three different ways. Let B be the “random bipartite graph”, the 
graph with vertex set X UY where X and Y are countable and disjoint, where 
edges between X and Y are chosen randomly. (A simple modification of the 
Erd6és—Rényi argument shows that there is a unique graph which occurs with 
probability 1.) Now consider the following topologies on a countable set X: 


T: point set V(R), sub-basic open sets are open vertex neighbourhoods. 
T*: points set V(R), sub-basic open sets are closed vertex neighbourhoods. 
Tt: points are one bipartite block in B, sub-basic open sets are neighbour- 
hoods of vertices in the other bipartite block. 


Proposition 19. (a) The three topologies defined above are all homeomor- 
phic. 
(b) The homeomorphism groups of these topologies are highly transitive. 


Note that the topologies are homeomorphic but not identical. For 
example, the identity map is a continuous bijection from 7* to 7, but is 
not a homeomorphism. 


10. Some Other Structures 


10.1. General Results 


As we have seen, R has several properties of a general kind: for example, 
homogeneity, No-categoricity, universality, ubiquity. Much effort has gone into 
studying, and if possible characterizing, structures of other kinds with these 
properties. (For example, they are all shared by the ordered set Q.) 

Note that, of the four properties listed, the first two each imply the third, 
and the first implies the fourth. Moreover, a homogeneous structure over a 
finite relational language is No-categorical, since there are only finitely many 
isomorphism types of n-element structure for each n. Thus, homogeneity is 
in practice the strongest condition, most likely to lead to characterizations. 

A major result of Lachlan and Woodrow [48] determines the countable 
homogeneous graphs. The graphs H,, in this theorem are so-called because 
they were first constructed by Henson [36]. 


Theorem 14. A countable homogeneous graph is isomorphic to one of the 
following: 
(a) The disjoining union of m complete graphs of size n, where m,n < No 


and at least one of m and n is Xo; 


(b) Complements of (a); 
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(c) The Fraissé limit H,, of the class of K,,-free graphs, for fixed n > 3; 
(d) Complements of (c); 
(e) The random graph R. 


The result of Macpherson and Tent [49] shows that the automorphism 
groups of the Henson graphs are simple. It follows from Proposition 15 
that Aut(R) is not isomorphic to Aut(H,,). Herwig [38] showed that these 
automorphism groups are all pairwise non-isomorphic; see also Barbina and 
Macpherson [3]. 

Other classes in which the homogeneous structures have been deter- 
mined include finite graphs (Gardiner [32]), tournaments (Lachlan [47|— 
surprisingly, there are just three), digraphs (Cherlin [21] (there are un- 
countably many, see Henson [37]), posets (Schmer] [62]) and permutations 
(Cameron{12]). In the case of posets, Droste [23] has characterizations under 
weaker assumptions. 

For a number of structures, properties of the automorphism group, 
such as normal subgroups, small index property, or existence of generic 
automorphisms, have been established. 

A theorem of Cameron [8] determines the reducts of Aut(Q): 


Theorem 15. There are just five closed permutation groups containing the 
group Aut(Q) of order-preserving permutations of Q, viz.: Aut(Q); the group 
of order preserving or reversing permutations; the group of permutations 
preserving a cyclic order; the group of permutations preserving or reversing 
a cyclic order; and Sym(Q). 


However, there is no analogue of Theorem 12 in this case, since there is 
no Glebskii-Blass—-Fagin—Harary theory for ordered sets. (Q is dense; this is 
a first-order property, but no finite ordered set is dense.) 

Simon Thomas [70] has determined the reducts of the random k-uniform 
hypergraph for all k. 

Since my paper with Paul Erdés concerns sum-free sets (Cameron and 
Erdos [14]), it is appropriate to discuss their relevance here. Let H,, be the 
Fraissé limit of the class of K,,-free graphs, for n > 3 (see Theorem 14). These 
graphs were first constructed by Henson [36], who also showed that H3 admits 
cyclic automorphisms but H,, does not for n > 3. We have seen how a subset 
S of N gives rise to a graph I'(S) admitting a cyclic automorphism: the vertex 
set is Z, and x ~ y if and only if |x — y| € S. Now ['(S) is triangle-free if and 
only if S is sum-free (i.e., z, yE S>at+y ¢ 8S). It can be shown that, for 
almost all sum-free sets S (in the sense of Baire category), the graph T'($) 
is isomorphic to H3; so H3 has 2*° non-conjugate cyclic automorphisms. 
However, the analogue of this statement for measure is false; and, indeed, 
random sum-free sets have a rich and surprising structure which is not well 
understood (Cameron [9]). For example, the probability that I'(S) is bipartite 
is approximately 0.218. It is conjectured that a random sum-free set 5S almost 
never satisfies ['(S') = H3. In this direction, Schoen [63] has shown that, if 
T(S) = Hs, then S has density zero. 
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The Henson K,,-free graphs H,,, being homogeneous, are ubiquitous in 
the sense of Baire category: for example, the set of graphs isomorphic to H3 
is residual in the set of triangle-free graphs on a given countable vertex set 
(so H3 is ubiquitous, in the sense defined earlier). However, until recently, 
no measure-theoretic analogue was known. We saw after Proposition 12 
that a random triangle-free graph is almost surely bipartite! However, 
Petrov and Vershik [57] recently managed to construct an exchangeable 
measure on graphs on a given countable vertex set which is concentrated on 
Henson’s graph. More recently, Ackerman, Freer and Patel [1] showed that 
the construction works much more generally: the necessary and sufficient 
condition turns out to be the strong amalgamation property, which we 
discussed in Sect. 5. 

Universality of a structure M was defined in a somewhat introverted way 
in Sect.5: M is universal if every structure younger than M is embeddable 
in M. A more general definition would start with a class C of structures, 
and say that M € C is universal for C if every member of C embeds into 
M. For a survey on this sort of universality, for various classes of graphs, see 
Komjath and Pach [46]. Two early negative results, for the classes of locally 
finite graphs and of planar graphs, are due to De Bruijn (see Rado [59]) and 
Pach [56] respectively. 


10.2. The Urysohn Space 


A remarkable example of a homogeneous structure is the celebrated Urysohn 
space, whose construction predates Fraissé’s work by more than two decades. 
Urysohn’s paper [74] was published posthumously, following his drowning in 
the Bay of Biscay at the age of 26 on his first visit to western Europe (one 
of the most romantic stories in mathematics). An exposition of the Urysohn 
space is given by Vershik [75]. 

The Urysohn space is a complete separable metric space U which is 
universal (every finite metric space is isometrically embeddable in U) and 
homogeneous (any isometry between finite subsets can be extended to an 
isometry of the whole space). Since U is uncountable, it is not strictly covered 
by the Fraissé theory, but one can proceed as follows. The set of finite 
rational metric spaces (those with all distances rational) is a Fraissé class; the 
restriction to countable distances ensures that there are only countably many 
non-isomorphic members. Its Fraissé limit is the so-called rational Urysohn 
space Ug. Now the Urysohn space is the completion of Ug. 

Other interesting homogeneous metric spaces can be constructed simi- 
larly, by restricting the values of the metric in the finite spaces. For example, 
we can take integral distances, and obtain the integral Urysohn space Uz. We 
can also take distances from the set {0,1,2,...,4} and obtain a countable 
homogeneous metric space with these distances. For k = 2, we obtain 
precisely the path metric of the random graph R. (Property (*) guarantees 
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that, given two points at distance 2, there is a point at distance 1 from both; 
so, defining two points to be adjacent if they are at distance 1, we obtain a 
graph whose path metric is the given metric. It is easily seen that this graph 
is isomorphic to R.) 

Note that R occurs in many different ways as a reduct of Ug. Split the 
positive rationals into two dense subsets A and B, and let two points v, w be 
adjacent if d(v,w) € A; the graph we obtain is R. 

A study of the isometry group of the Urysohn space, similar to that done 
for R, was given by Cameron and Vershik [19]. The automorphism group 
is not simple, since the isometries which move every point by a bounded 
distance form a non-trivial normal subgroup. 


10.3. KPT Theory 


TI conclude with a brief discussion of a dramatic development at the interface 
of homogeneous structures, Ramsey theory, and topological dynamics. 

The first intimation of such a connection was pointed out by Nesetfil [51]; 
in [52] he suggested using this connection to characterize Ramsey classes, 
see also [43]. We use the notation CG) for the set of all substructures of A 
isomorphic to B. A class C of finite structures is a Ramsey class if, given a 
natural number r and a pair A, B of structures in C, there exists a structure 
C €C such that, if ( is partitioned into r classes, then there is an element 


B' € (§) for which @) is contained in a single class. In other words, if we 
colour the A-substructures of C with r colours, then there is a B-substructure 
of C, all of whose A-substructures belong to the same class. Ramsey’s classical 
theorem asserts that the class of finite sets is a Ramsey class. 


Theorem 16. A hereditary isomorphism-closed Ramsey class is a Fraissé 
class. 


There are simple examples which show that a good theory of Ramsey 
classes can only be obtained by making the objects rigid. The simplest way 
to do this is to require that a total order is part of the structure. Note that, 
if a Fraissé class has the strong amalgamation property, than we may adjoin 
to it a total order (independent of the rest of the structure) to obtain a new 
Fraissé class. We refer to ordered structures in this situation. Now the theorem 
above suggests a procedure for finding Ramsey classes: take a Fraissé class 
of ordered structures and test the Ramsey property. A number of Ramsey 
classes, old and new, arise in this way: ordered graphs, K’,-free graphs, metric 
spaces, etc. Indeed, if we take an ordered set and “order” it as above to obtain 
a set with two orderings, we obtain the class of permutation patterns, which 
is also a Ramsey class: see Cameron [7], B6ttcher and Foniok [13], Sokié [67] 
and, for a more general result, Bodirsky [6]. 

The third vertex of the triangle was quite unexpected. 
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A flow is a continuous action of a topological group G on a topological 
space X, usually assumed to be a compact Hausdorff space. A topological 
group G admits a unique minimal flow, or universal minimal continuous 
action on a compact space X. (Here minimal means that X has no non- 
empty proper closed G-invariant subspace, and universal means that it can 
be mapped onto any minimal G-flow.) 

The group G is said to be extremely amenable if its minimal flow consists 
of a single point. 

The theorem of Kechris, Pestov and Todorcevic [45] asserts: 


Theorem 17. Let X be a countable set, andG a closed subgroup of Sym(X). 
Then G is extremely amenable if and only if it is the automorphism group of 
a homogeneous structure whose age is a Ramsey class of ordered structures. 


As a simple example, the theorem shows that Aut(Q) (the group of order- 
preserving permutations of Q is extremely amenable (a result of Pestov). 

The fact that the two conditions are equivalent allows information to 
be transferred in both directions between combinatorics and topological 
dynamics. In particular, known Ramsey classes such as ordered graphs, 
ordered K,-free graphs, ordered metric spaces, and permutation patterns 
give examples of extremely amenable groups. 

The theorem can also be used in determining the minimal flows for various 
closed subgroups of Sym(X). For example, the minimal flow for Sym(X) is 
the set of all total orderings of X (a result of Glasner and Weiss [33]). 


Acknowledgements I am grateful to J. Schmerl and the editors for helpful 
comments. 
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1. An Apology 


Paul Erdés has published more than one hundred research papers in set 
theory. It is my rough estimate that these contain more than one thousand 
theorems, many having an interest in their own right. Although most of his 
problems and results have a combinatorial flavour, and the subject now known 
as “combinatorial set theory” is one he helped to create, it is also true to say 
that his work has had a very important impact upon the direction of research 
in many parts of present day set theory. Whole theories have developed out 
of basic questions which he formulated. 

This (relatively) short note does not, and is not intended to, give a 
methodical survey of set theory or combinatorial set theory or even of Erdés’ 
work in set theory. I shall simply write about some of the ideas as I learned 
them during our cooperation over many years, some of the highlights, and 
some of the outstanding results. In many cases it will not be possible to give 
a detailed discussion of the present day status of some of the problems I shall 
mention. If the reader considers that my own name occurs too frequently in 
this note, I can only offer the excuse that we have published more than 50 
joint papers, mostly in set theory, and I probably know these papers better 
than the rest of his work. 


2. Early Days and Some Philosophy 


Paul was a child mathematical prodigy, and he started to discover outstand- 
ing original results in number theory as a first year undergraduate. We are 
familiar with the names of mathematicians who influenced his early work in 
number theory and analysis, Pal Veress, Fejér, Davenport, Mordell to mention 
just a few. But this is not true for set theory. Paul told me that he learned the 
basics of set theory from his father, a well educated high-school teacher, and 
he soon became fascinated with “Cantor’s paradise”. However, he discovered 
set theory as a subject for research by himself. 
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Paul, who has always refrained from seriously formulating any kind of 
philosophy, was (and still is) the ultimate Platonist. X.,..,,41 existed for him 
just as surely as 3, the smallest odd prime. He was driven by the same 
compulsive search for “truth” whether he was thinking about inaccessible 
cardinals or twin primes. Moreover, he could switch from one subject to the 
other in an instant. All questions which admit a relevant answer in finite 
combinatorics should also be asked and answered in set theory, and vice- 
versa. A large part of his greatness lies in the fact that he really did find the 
relevant questions. It was this attitude which led him to his first encounter 
with (actual) infinity. 

In 1931, as a first year undergraduate student attending the graph theory 
course of Dénes Konig, he proved a generalization of Menger’s theorem for 
infinite graphs. This only appeared in 1936 at the end of Konig’s book 
on graph theory. In 1936 he wrote a paper jointly with Tibor Gallai and 
Endre Vazsonyi having a similar character; they gave necessary and sufficient 
conditions for an infinite graph to have an Euler line [1, 2]. 

The next paper I have to mention [ESz], about finite combinatorics, was 
written with George Szekeres in 1935. They rediscovered the finite version 
of Ramsey’s theorem and proved a fundamental Ramsey-type result of finite 
character: If G is a graph having ure vertices (k,£ > 3), then either 
G contains a complete graph on k vertices or there is an independent set 
of & vertices. From then on he always had in mind possible generalizations 
of Ramsey’s theorem, and so became the creator of both finite and infinite 
Ramsey theory. 


3. Infinite Ramsey Theory: Early Papers 


The Ramsey theorem is about partitions of (finite) k-element subsets of w (the 
set of non-negative integers), and in the mid-1930s Erdés began to speculate 
about partitioning the countable sets. He corresponded with Richard Rado in 
Cambridge, England about this problem and Rado proved the first theorem 
saying that “nothing can be said in this case”. The result appeared only much 
later, in the early 1950s, in a sequence of joint papers by them [9, 12, 13, 18]. 
I will return to this later. 

Erdés’ first real set-theoretic result appeared in a paper of Dushnik and 
Miller [DM]. The theorem, now known as the Erdés; Dushnik, Miller theorem, 
says that for an infinite cardinal x, if a graph on K vertices does not contain an 
infinite complete subgraph, then there is an independent set of vertices of size k. 
This was the first “unbalanced” generalization of Ramsey’s theorem. Once the 
result is formulated, the verification for regular « is a fairly easy exercise. Erd6és 
proved it for singular «, and his proof, which required a good technical knowledge 
of the set theory of those days, is included in the Dushnik-Miller paper. 

Soon after, in 1942, he proved in [3] the basic theorems of infinite Ramsey 
theory. Let [X]" denote the set of r-element subsets of X, and let f : [X]" > 7 
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be an r-partition of y colours on X. A set H C X is homogeneous for f in 
the colour v < y if f(Y) = v for all Y € [H]". More than 10 years later in 
joint work with Paul, [12], Richard Rado introduced the partition symbol 


KR Ov)rey 


to denote the following assertion: for any r-partition f : [K]" > y there are 
v<y and H Cx such that |H| = A, and H is homogeneous for colour v. 
The negation of this is denoted by replacing the arrow by a crossed arrow, 
~. Tf A, = A for all vy < 7, the notation « + (A)* is used; this is called the 
“balanced” partition symbol. 

Using this notation, Ramsey’s theorem states 


w—(w), forl<rk<w. 
The Erdés; Dushnik, Miller theorem says, for any infinite cardinal x, 
K — (K, No)”. 
The result of Rado I did not state says, for every «, 
I Pe (No)3°. 


Using these notations, the results proved by Erdés in the 1942 paper are the 
following: 


(i) (2)* + (A4)2. 
(ii) 24 ~ (3)%. 
(iii) Assuming the generalized continuum hypothesis (GCH in what follows) 

Rate = (Nata, Roai)?. 

(iv) 24 -» (At)2. 

He attributes (iv) to Sierpinski, who proved it for 4 = No. He also mentions 
that the obvious ~ relation (ii) was pointed out to him by Gédel in 
conversation. 

Of course, the partition symbol was not used in that early paper. In fact 
Erd6és was always slightly resistant to its use. Later, when forced, he did 
sometimes write the symbol, but I have never seen him read it. When we 
were discussing such relations he frequently asked me in a complaining voice 
to “state it in human language”. 

The observant reader would already have noted that (iii) was an attempt 
to find the right generalization of the Erdés; Dushnik, Miller theorem. We 
will return to this topic later. 


4. His “Remarks” 


As yet we are still in 1943, and in that year two more significant papers 
appeared. First I want to say a few words about [5], “Some remarks on Set 
Theory”. I quote the first sentence: “This paper contains a few disconnected 
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results on the theory of sets.” The “Remarks” became a series. Eleven 
of them appeared in all. The fifth and sixth were written jointly with 
Géza Fodor, the seventh to ninth and eleventh with me, and the tenth 
with Michael Makkai. Several of these papers contain set-theoretical results 
about Euclidean spaces, Hamel bases and other objects that are familiar 
to analysts and combinatorialists. The editors and I have decided that this 
typical Erd6és genre deserves a separate treatment, and this will be given by 
Péter Komjath in this volume. However, I cannot resist mentioning the first 
theorem in the first of these papers, the Erddés-Sierpinski duality principle. 
This generalization of an earlier result of Sierpirski states: Assuming the 
continuum hypothesis (CH) there is a surjective map f : R > R which 
interchanges sets of Lebesgue measure zero and sets of first category. 

Stating this theorem allows me an opportunity to say something about 
his attitude towards the generalized continuum hypothesis (GCH) and 
mathematical logic in general. It should be remembered that in 1943 Godal’s 
proof of the consistency of GCH was quite new. Erdds always knew and 
appreciated and applied these results. He was happy to have them as a more- 
or-less justified tool to prove new theorems, and if he could not solve a set 
theory problem he always tried to solve it assuming GCH. On the other 
hand, later on he was always uneasy and disappointed if one of his favourite 
problems turned out to be independent, and he would remark “independence 
has raised its ugly head”. 


5. Large Cardinals: The Erdos-Tarski Paper 


The cardinal «, has the property P(«) if there is a field of sets which 
contains a family of A pairwise disjoint sets for every \ < &, but which 
does not contain such a family of size «. Much to their surprise, Erddés 
and Tarski [4] proved that for limit cardinals «, P(«) holds if and only if 
&, is an uncountable inaccessible. First of all it was surprising that such a 
seemingly harmless problem should involve inaccessible cardinals which, in 
those days, had “hardly been born”. The second surprise was, and this is 
explicitly mentioned in the paper, that the negation of P(2*°) could not 
be proved in ZFC since it was generally believed that it would be proved 
consistent that 2®° is inaccessible. (Indeed, this was one of the first corollaries 
of Cohen’s method.) 

In their paper, they formulated several properties of inaccessible cardinals 
and they mentioned quite a few connections between these properties in 
footnotes (without proofs). For example, they knew that if « is measurable 
then it has the tree property and that this implies that « — («)} holds for 
all A < « andr < w. Of course, they also knew that the simplest Ramsey- 
type theorem « — («)3 is false if « is not strongly inaccessible. Later, in 
1960 when Tarski, using a result of Hanf, proved that “small” inaccessibles 
are not measurable, and the theory of large cardinals was created, it became 
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necessary to publish these classical proofs. A new Erdés-Tarski paper (written 
by Donald Monk) appeared in 1961. 

It is of interest to note that Erddés and Tarski made a historical 
mistake in [4]. It was vaguely speculated that it may turn out to be at 
least consistent that all strongly inaccessible cardinals are measurable. This 
probably postponed the discovery of the true situation for almost 20 years. 
I cannot say how seriously Tarski believed this (I did try to ask him), but 
Erdés quite happily accepted this hypothesis in the spirit I described in §2. 
In our joint work from 1956 to 1960 we investigated every combinatorial 
property of strongly inaccessible cardinals under the assumption that they 
are measurable, although we did always mention that this was an assumption. 
However, as was so often the case for Paul, in the end this turned out to be 
quite fortunate. I will come back to this in 89 and §10. 


6. Set Mappings and Compactness 


Erdos visited Hungary in 1948 for the first time after the Second World War. 
Very likely it was during this visit that he recalled an old problem of Paul 
Turan. Let f be a set mapping on a set X, ie. f : X — P(X) (the power 
set of X) such that x ¢ f(x). We say that f is of order X if |f(a)| < A for 
x € X. A subset S C X is independent for f if for all z,y € S, y ¢ f(a). For 
combinatorialists, a set mapping of type \ is just a loop-free digraph having 
out degrees < A. Turdn was interested in the case when X = R and f(z) is 
finite, ie., f is of order w, and he asked if there is a free set of power 2*°. 
A young Hungarian, Dezs6 Lazar who was killed during the war, proved this 
and he also proved that if f is of order 7X < K > w then there is a free set 
of size « provided & is a regular cardinal. Ruziewicz conjectured that this is 
true for any kK > w. 

Erdés proved this conjecture assuming GCH in the second of his 
“Remarks” in 1950 [10]. It remained an open question for ten more years 
if GCH is really needed. In 1960, I proved the result in ZFC in [H 1] where 
more history of the problem can be found. 

Typically, even before proving the result he conjectured that if \ is an 
infinite cardinal and f is a set mapping of order \ on any set X, then X is the 
union of A independent sets, i.e. the digraph has chromatic number at most A. 
This was proved by Géza Fodor [F]. Erdés investigated the problem for finite 
A in a paper with N.G. de Bruijn [11]. A little reflection will convince the 
reader that if the underlying set is finite then it is the union 2A—1 independent 
sets (and this is best possible). To show that this is true for arbitrary X they 
proved that for any k < w if every finite subgraph of a graph G has chromatic 
number of at most k then G also has chromatic number at most k. 

The reader may say that this is a consequence of either Tychonov’s 
theorem on the product of compact spaces or Gédel’ s compactness theorem, 
but this is how compactness was introduced to infinite combinatorics. Let me 
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point out that it would be quite difficult to find a proof of the set mapping 
theorem not using compactness. I do not know of any. 

Erdos continued the investigation of set mappings with Géza Fodor in 
[19] and [21]. I want to mention one of their theorems, which later proved to 
be useful in applications. 

Assume f is a set mapping of order’ <K>w onk. Let T < « and let 
Xa(a <7) be a sequence of subsets of & each of size K. There is a set S' free 
for f which meets each Xq in a set of size x. For singular « their proof used 
GCH, but my method yields this in ZFC as well. 


7. A Partition Calculus in Set Theory 


The partition calculus was developed by Erdos and Rado in the early 1950s. 
The long paper [18] contains all the results they had proved up until then. 
Their first discovery was that the partition relation k > (A, )/)<.,, made sense 
for order types as well as cardinals, or even a mixture of these. This led 
them to a great variety of new problems, some simple and some difficult, 
but requiring different methods. Let me mention just a few of these. For 
what countable ordinals a does a — (a,3)? hold? For what a < w; does 
A — (a)? hold, where k is finite and \ is the order type of the reals? They 
proved the pleasing result that 7 — (No,7)?, where 7 is the order type of 
the rationals, but for what other countable types is this true? They also 
noticed that the proof of the Erdés; Dushnik, Miller theorem in the special 
case K = Ww actually gives the slightly stronger fact that w; 4 (w1,w + 1)?, 
and then a natural question is, what about w; 4 (w1,w + 2)?? 

They proved a great many partial results and isolated the most important 
problems. We cannot collect all their results and problems here, instead I shall 
discuss some of the important new discoveries. One of these is the positive 
stepping up lemma which can be stated as follows: If & is a cardinal and 
K > (Av)bey holds, then (2<")+ + (A, +1)2E4. Here y+ denotes the smallest 
cardinal greater than py, and 2<" = S°{2" : yy < «a cardinal}. Let exp,,() 
denote the n-times iterated exponentiation (i.e. expg(&) = & and exp,,, 1 (K) = 
2exPn(*),) Since 2<*" = 2", and «+ > («*)! just expresses the fact that «+ 
is a regular cardinal, we obtain by induction the 


Erd6és-Rado Theorem: 
(exp, («))* > («t)R*?. 


In particular, we have (2")+ > (K+)2,(2?°)+ > (c+)3 ete. Quite often 
in the literature only the first of these (case n = 2) is referred to as the 
Erd6s-Rado theorem, but we have seen that this was already proved in 1943. 
There can be very few theorems in set theory which have received so many 
“simplified” proofs as the Erdés-Rado theorem. But in the early 1950s there 


was no pressing-down lemma, and elementary substructures and chains had 
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not been introduced into set theory. Erdés and Rado used the so-called 
ramification method. Let me outline this in a simple case. Let f : [X]? > 
be a 2-partition of X with y colours. Pick an xp € X. The rest of the points 
can be split into y parts according to the colour of f({xo, y}). Repeat this 
in each part and continue transfinitely. We get a tree or ramification system 
as they called it. At the a stage we will have |y|!“! parts. If X has large 
enough cardinality, the tree we get will have large height. The points picked 
along a branch of this tree will form a prehomogeneous set, i.e., the colour of 
a pair {%, xg} will depend only upon the point, say xq, which was chosen 
first. This method is fairly hard to write down formally, but it is really quite 
intuitive. It was elaborated in great detail in the Erdés-Rado papers and was 
used for several years to obtain positive partition relations for the case when 
the underlying set has regular cardinality. Although most of the important 
proofs have been streamlined to “linear” ones, there is really no algorithm 
for this translation , and the intuition behind ramification serves as a good 
tool to obtain new results. 
They also discovered polarized partition relations. The symbol 


()=(),. 
A Ay v<y 
means the following: whenever f : & x \ — v is a colouring with y colours, 
then there arev < y, K € [k]*” and L € [A]*” such that K x L is homogeneous 
for f in the colour v. For combinatorialists, this is just Ramsey for complete 
bipartite graphs, and the reader can easily formulate the generalization to 
s-partite graphs. 

However, this is not just formalism. It turned out that quite a few 


problems about polarized partitions are basic questions in set theory. As 
an illustration, they proved 


Ni ~ Ni No \*? 
Xo on a 


In “human language” this says: if Ag (a < w1) are arbitrary subsets of w, 
then either the intersection of Xi of them is infinite, or the union of Xo of 
them has an infinite complement. They attributed the negative relation 


RX, Ni No)" 
CHs jc ms & o | 
to Sierpinski who, of course, proved this in a different context. 

There is one more important partition relation I should mention, and this 
realized Paul’s old wish to have a Ramsey theorem for something more than 
just k-element sets. They introduced the symbol « — (A)5“ to denote the 
following statement: for every sequence fi, : [K|" > y of n-partitions of K with 


y colours, there is a subset H C # of cardinality X which is simultaneously 
homogeneous for each f,,. They only proved that 2° + (No)5% with a clever 
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ad-hoc construction, and they asked if K — (No)5“ can be true for any 
cardinal «? We turn back to the discussion of this important symbol in 89 
and 810. 

There is one more type of partition theorem which I should have 
mentioned earlier. In [9] they proved the first canonical Ramsey theorem, 
and this also led to a long sequence of investigations, improvements and 
generalizations. Just to give the flavour of what this is all about, I will state 
just one special case. Let f : [w]? > 7 be a 2-partition of w with any number 
of colours. Then there is an infinite subset H C w such that either H is 
homogeneous for some colour, or all pairs in H have different colours, or H 
is prehomogeneous (the colour of a pair depends only on the least element), 
or H is endhomogeneous (the colour of a pair depends only on the largest 
element). 


8. My First Encounter with Paul 


Erdés visited Hungary in 1955 for the first time since the country had become 
a member of the Eastern block. He was an Hungarian citizen traveling on an 
Hungarian passport, and he could not have returned earlier if he wanted to 
leave again. But in the “liberalized” atmosphere of 1956 the Academy was 
allowed to elect him as a member and the government granted him a special 
diplomatic type of passport which allowed him to come and go whenever he 
wished. This was a great opportunity for young Hungarian mathematicians 
who had heard of him only by name (of course, it was impossible for us to 
travel to the West before 1956). 

At that time I was a graduate student of Ldszl6 Kalmar in Szeged (a 
small town on the south-eastern border of Hungary). Paul travelled around 
the country in 1956 and came to visit the mathematics department at the 
University of Szeged. He had already corresponded with Géza Fodor who was 
then a young assistant professor in the department. I was introduced to him 
as “a promising young man” studying set theory, and soon we were left alone 
in Professor Kalmar’s office sitting in two enormous armchairs facing each 
other over a coffee table. I thought he was very old—he was 43 years old and 
I was 25. I felt very honoured, and a little embarrassed, to be left alone with 
this famous man. I did not know then that he had met most of his young 
collaborators in a similar way. He first asked me what were my interests in 
set theory. I was then writing my thesis on a subject which later was called 
relative constructibility, and I was quite proud of it. So I started to explain my 
results with some enthusiasm. He listened to me very politely, and when I had 
finished he asked “and are you interested in normal set theory as well?” Of 
course, we were not on first-name terms then and so the question was phrased 
in a very polite form of Hungarian that is used for addressing a stranger. But 
it was clear that it was a genuine inquiry and he meant no harm. 
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Earlier I had thought of a problem when I heard about all the set-mapping 
results from Géza, What if we investigate set-mappings of more variables, and 
I asked if there would be large free subsets in this case too? To tell the truth, 
as a student of Kalmar, I was trained to think like a logician. So what I had 
in mind was this: if X is a large set and to every finite subset V C X we 
associate a countable F(V) C X such that F(V)NV = 9, is there a large 
independent set? I even had the vague idea that if F(V) is the Skolem hull 
of V in some structure, then an “independent set” would really deserve its 
name. 

Luckily, Paul liked that one! It started a furious activity and the 
conversation became more fluent and colloquial. The first thing I learned 
from him (and this took quite a while) was that he would not start to 
think about the general case. He first wanted to know what happened for 
set mappings defined on pairs. He proved several lemmas and some partial 
results and stated a few conjectures. He then suddenly remembered that 
there was something else he had to do and he called Géza. Quite close to 
the mathematics building in Szeged stands a rather ugly cathedral built in 
the 1930s with two high towers. It turned out that he “must” climb the 300 
odd stairs to the top! Géza had earlier agreed to accompany him, and he 
then gently began to persuade me to come along too. I had by then lived for 
2 years in Szeged, and I had never had the slightest difficulty in resisting any 
pressure to visit the tower, especially since the surrounding countryside is 
absolutely flat and so there was not very much to see. However, much to my 
own surprise, I could not resist this invitation. Climbing those stairs more 
results and conjectures were formulated by him while, at the same time, he 
was complaining that he felt a little dizzy. 

That day ended with dinner at Kalmar’s house where the conversation 
continued mainly about set mappings, but sometimes interrupted with some 
of his comments on “Sam and Joe”. When we parted, it was almost as from 
an old friend—there was a joint-paper half ready, which could be completed 
by correspondence. 


9. Our First Joint Paper 


The notation used for discussing set-mapping problems is not standardized 
as in the case of partition relations. A set-mapping of order and type yz on 
« is a function f : [Kk] > [k]<* such that f(x) x = @ for all x € dom/(f), 
and a set S C « is free if f(z) NS = Q for all x € [S]#. I shall denote by 
Free(«, A, 4,1”) the following assertion: For every set-mapping of order X and 
type 4 on « there is a free set of cardinality v. Likewise, Free(K, A, < p,V) 
denotes the corresponding assertion when f : [K]<" — [K]<>. 

Our very first result was to prove that Free(exp,,_;(A)*,A,n, A?) holds 
for every infinite A. The proof of this uses the Erdés-Rado theorem which I 
had learned during my first conversation with Paul. I also learned that it was 
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not known if the Erdés-Rado theorem is best possible, for example it was not 
known then if 


gt (aye 


holds. Assuming GCH we could prove — Free(Nj, 2, 2,%1) so that our theorem 
is best possible for n = 2, but for n > 2 any progress seemed to lie far in the 
future. I will return to this in the section on the negative stepping-up method 
we developed with Rado, but let me say now that there are only consistency 
results to show that the theorem is best possible. 

I was therefore surprised when shortly afterwards I received a letter from 
Paul (who was visiting Israel) claiming that Free(«,A,n,«) holds for n < w 
and any uncountable limit cardinal x > . The proof used GCH and I realized 
that it only worked for singular « (the real theorem is for « a singular strong 
limit cardinal, i.e. 27 < « for T < &.) We now know that this is a special 
case of a general “canonization” theorem proved later with Rado, but which 
Paul discovered in at least two other interesting contexts before the general 
theorem was formulated. I wrote to tell him that I could not see how the 
proof works for regular limit «. He replied by return of post that I was right, 
but the theorem was true since we may use the “measure hypothesis” from 
Erdés-Tarski, and he wrote down a proof that Free(«, A, n, «) holds for finite n 
and  < «, an uncountable measurable cardinal. I have to say that during my 
studies I had read the Erdés-Tarski paper, but either I skipped the footnotes 
or I did not recognize the significance of the remarks. However, after reading 
the letter, I did understand the strength of the hypothesis, and the same day 
proved that it implies Free(«, A, < w,«). Later, when the paper was actually 
written, we realized that the proof actually gave the stronger result that 


> (K)x 
holds for A < « if K > w is a measurable. This is perhaps one of our best- 
known joint theorems, and I will say more about this in the next section. 
This brings me to our first joint oversight. Although our joint paper with 
Rado did not appear until 1965, I already had a weak form of the negative 
stepping-up lemma in 1957 (in fact it was because of this that we decided 
to write the triple paper on partition relations even though Erdos and Rado 
had already obtained a great many new unpublished results.) 
I told Paul that the negative stepping-up gives us that 


Kw (w)3% = 28 (w) 3°. 

He immediately pointed out that it is easy “to go through” singular 
cardinals, and we put into the paper the remark that Kk + (w)5“ holds 
for all «, less than the first strongly inaccessible cardinal kp > w. We only 
realized later [35], after we learned the Hanf-Tarski results, that this almost 
trivially implies that 


Ko % (wt 15%, 
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and therefore, by our theorem, it follows that #9 is not measurable. 
Unfortunately, this argument is not very strong, we could never make it 
work beyond the first fixed point in the sequence of inaccessibles. 


10. Erdés Cardinals and the Strength of « — (A)5“ 


The real strength of the statement « > (A)5“ was discovered by the Berkeley 
school in the early 1960s. Dana Scott first proved that the existence of a 
measurable cardinal contradicts Gédel’s axiom of constructibility (V = L). 
Soon afterwards Gaiffman and Rowbottom proved that it also implies that w 
has only countably many constructible sets, and more generally Rowbottom 
proved that « + (w)5* implies that w) is inaccessible in L. 

Rowbottom also generalized our theorem with Paul. According to the 
Kiesler-Tarski paper [KT] it was Dana Scott who introduced the notion of a 
normal measure (a K-complete 0-1 measure on « satisfying the pressing down 
lemma, i.e. if f is regressive on a subset of measure one, then it is constant 
on a set of measure one.) It was known that if « is measurable then it carries 
a normal measure, and Rowbottom proved that, if there is a normal measure 
on « and f : [K]” > A(n < w,A < &), then there is a homogeneous set of 
measure one. Of course, this immediately yields our theorem, and the proof 
actually becomes easier. 

I happened to spend 1964 at Berkeley with Tarski’s group and gave 
a course of lectures on Erddés-Rado set theory. I could not have been too 
successful as a lecturer, more than 20 people attended the first lecture, and 
in the end I was left with an audience of three—two students, Reinhardt 
and Silver, and a young assistant professor Donald Monk. I told them 
everything I knew about ordinary partition theorems and the little I knew 
about « — (A)5“. Silver apparently got interested and his thesis [Si], which 
appeared in 1966, contained some fantastic discoveries. 

First he realized that the real strength of « > (A)5“ is that it yields, for 
any given structure on «, a set of indiscernibles having order type A (i.e. a 
set of ordinals such that any two similarly ordered n-tuples satisfy the same 
formulas.) Using this he proved that the smallest « satisfying « + (w)5“ must 
be very large, for example there must be many weakly compact cardinals less 
than «. He showed that, for a < w, if K + (a)$“” holds, then it is true in L, 
and finally he proved that if k + (w1)$“ holds, then O* exists, which means 
that L is very small, and this expresses the real strength of « > (w,)5%. Iam 
not willing to write down the technicalities of this here. 

Let me remind the reader that one of the few consequences of the axiom 
of constructibility which G6del himself had noticed was that there is an 
uncountable analytic complement (the complement of a continuous image 
of a Borel set) which has no perfect subset. It was Solovay who proved that 
if k — (w1)5“ holds for some « then such a set cannot exist. This was 
the first application in descriptive set theory. It is stated in Descriptive Set 
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Theory, the 1978 book of Moschovakis, that all applications of the existence 
of measurable cardinals in descriptive set theory come from a & satisfying 
K —> (w1)5“. All this shows that such cardinals deserve a special name, and 
the story I have written down shows that they are quite rightly called Erd6és 
cardinals. 


11. The Early Sixties. A Long Chapter 


After 1956, Paul came home to visit his mother every year. He usually spent 
some months in Budapest where I also lived at that time. When he was at 
home I went to work with him at their apartment two or three times a week. 
Mrs. Erd6s was not only a devoted mother to Paul, but she was also an 
efficient secretary and would keep a record of his publications and look after 
his papers. When I visited them she would make us coffee and then leave 
us alone to “work”. Our meetings had no prepared agenda, sometimes we 
went through earlier proofs, sometimes we had to read a manuscript or proof 
sheets, but the main point of our conversations was always the discovery of 
new problems and to start thinking about them. Paul was fantastically fast 
in both making and understanding proofs and finding the new questions. 
Though I usually made some notes, they were never quite satisfactory. We 
both needed to rely on our memories. This was quite a good fit, he always 
remembered the theorems and then I could scrape together the old proofs. 
I think now that these visits were real highlights in my life. 

Now I have to change strategy. I cannot continue telling the results paper 
by paper, and in any case they were not proved in the order of publication. 
Starting around 1957 or 1958, we agreed to write a triple paper with Rado 
on the partition calculus and the three of us set aside everything which we 
thought belonged there. Already in 1960 I visited Rado in Reading to work 
on the triple paper, carrying with me an almost completed manuscript. So, 
in this long section I will open subsections about the results of these years 
with an indication of where they appeared. 


11.1. Canonization 


Let f : [X]" > y be an r partition of length y of X. Let (Yo : a < y) be 
a sequence of disjoint subsets of X , Y = Uj, Ya For a subset v € [Y]" 
there is a number s(v) <r an increasing sequence a(v) = (a;(v) : i < s(v)) 
of ordinals and a sequence r(v) = (rj(v) : 1 < s(v)) of integers, defining the 
position of v in the partition Y = eee Yq, so that Neicaty) ri(v) =r and 
lu Yo,(vy| = ri(v) for i < s(v). Two r-element sets v,v’ € [Y]" have the 
same position if a(v) = a(v’) and r(v) = r(v’). f is canonical with respect to 
the sequence (Ya: a < y) if for any two v,v’ having the same position 
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The “canonization” theorem of [43] tells us that: there is an integer k, such 
that whenever (Xa: @ < y) is a sequence of subsets of X with fast enough 
increasing cardinalities, 


|Xo| > expz, (| LU Xl), 
B<a 


then there is a disjointed sequence Y, C Xa(a < y) also with fast increasing 
cardinalities, |Ya| > |Ugeq Xa|, such that f : [x]" — y is canonical with 
respect to the sequence (Y, :a< y), provided y,p < |Xol. 

One corollary of this is the following. Assume & is a singular strong limit 
cardinal then & > («,A,)2<,, if and only if cf(K) — (cf(K),AL)22,. The 
‘only if? part comes easily using a canonical partition, and the ‘if’ part uses 
the canonization theorem. The reader should remember the Erddés; Dushnik, 
Miller theorem « — (k,No)?. Now the above result tells us, at least with 
GCH, for which singular cardinals « the relation «,— (K,%1)? holds. For 
example, Xu, ~ (Nw,,%1)? but Xu, > (Nu.,Ni)?. By now I do not really 
have to tell the reader that this is the form discovered by Paul. 

It would be nice to have a necessary and sufficient condition for the case of 
arbitrary singular «. We knew that for a singular x, say with cf(«) = (2%°)*, 
to have k, > (K,%1)? it is necessary to have \®° < « for \ < « We repeatedly 
asked if this is sufficient. It was proved by Shelah and Stanley in the 1980s 
that this is consistently false [SS1]. 

When preparing the material of our book with Attila Maté and Richard 
Rado [100] where we tried to discuss the ordinary partition relation is ZFC, 
we isolated the following problem. Assume there is an increasing sequence of 
integers n, : k < w such that 


Ry < 2Rmo cc Ome 


Does it follow that 2<®» — (X,,)3 holds? Clearly our canonization does 
not work in this case. Shelah proved this with a new type of canonization 
theorem [S1], and parts of his results are given in the book. Further uses of 
“canonization” will be mentioned later. 

One last remark. It is interesting to see how combinatorial ideas do pop 
up in different topics. When Shelah obtained with such miraculous speed his 
celebrated result on the bound in van der Waerden’s theorem, he was already 
the best expert on canonization, and one of the main lemmas in his proof is 
indeed a (finite) canonization theorem. 


11.2. Square Brackets 


Sierpiriski proved 2¥° + (N1)3 by well ordering the continuum and defining 
a partition of the pairs into two classes, so that a pair ordered in the same 
way in both the natural ordering and the well-ordering belongs to the first 
class. 
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Paul told me that he formulated a generalization of this in 1956 with the 
following question. Can one split the pairs of reals into three classes so that 
every subset of size X, (or 2°°) contains a pair from each class? 

He soon proved this assuming CH. We discovered that whenever a 
partition relation fails, one can ask for a corresponding weaker property, 
and in [43] we introduced the following square bracket relation 


as Dulp<y: 


This means that for every f : [k]" > 7 there isa v <y and a subset H Ck, 
|H| = X, so that f does not take the value v on the r-tuples of H. 

It is worthwhile to formulate separately the negation of the “balanced” 
form of this (when all the A, are equal). Thus « ~ [A] means that there is 
an f :[«&]" + 7 such that all subsets of size X are completely inhomogeneous 
i.e. f takes all possible values on the r-tuples of any set of size 4. 

Clearly we needed some test cases. We proved that 2" = «xt mplies 
K+ ~ [«t]?,, and only later did we realize that this was also known to 
Sierpiniski in a different context. But probably the nicest result was the 
following: 

If « is a strong limit cardinal of cofinality w, then K > [K]3. 

(Note that « + (K,(cf(K))T)2 and « ~ [«]3 is a trivial corollary.) This 
follows from the “canonization” theorem of the previous section. Indeed it 
gives a stronger result. Under the above conditions on «, for every f : [Kk]? > 
7, 7 <&, there is a set H € [«]" such that f takes at most two different 
values on the pairs of H. 

So we introduced a third symbol, the strong square bracket. Let y, 6 be 
cardinals. « — [A]! 5(« — [A]! -5) means that for every r-partition f : [«]” > 
y with y-colors, there is a subset H C « of size A such that f takes at most 
6 (fewer than 6) values on the r-tuples of H. 

So the above theorem says that k > [kK] 9 for singular strong limit « of 
cofinality w and 7 < &. We used this symbol to ask if Nz > [Nilg, x? Paul 
thought this was an old question of Ulam, but later we discovered that it is 
equivalent to a well-known model theoretical conjecture of C.C. Chang. 

In §12, I will discuss the effect of our 1967 problem paper [67], but this is 
a good place to write down the present status of some of the square bracket 
problems stated in that paper. Let me begin with an innocent but very nice 
result of Fred Galvin 


n> [nl3- 

Many generalizations of this were published later. Galvin and Shelah proved 
28o se [28o]s and cf(2¥°) -» [cf(2*°)]§, in 1968 [GS], they also proved some 
weak results like N;  [Ni]7 and 8, ~ [Ni], for the case when the underlying 
set has cardinality Ni. 

Again we had a false feeling. Although we did not state it explicitly, we 
clearly believed that X%; + dik, could not be proved in ZFC. But in 1987 
Stevo Todoréevié proved us wrong [T1]. He proved in ZFC that 
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+ 


KT oe [et]? 


Kr 

holds for every regular «. This was extended by Todoréevié and Shelah 
for more successors, and inaccessibles. This was certainly one of the most 
significant discoveries in set theory in the 1980s requiring entirely new 
methods. I will come back to this for a moment in the next section. However, 
this still leaves open the question whether 


aXe» [N]3 


holds? Shelah [S2] proved that 2%° — [Nj]? is really consistent with ZFC. In 
his model 2*° is quite large so it is still possible, but quite unlikely, that 


go =No Shor [Na ]3. 


11.3. Jé6nsson Algebras-Negative Relations with Infinite 
Exponents 


A Jénsson algebra is an infinite algebra A with countably many finitary 
operations such that all proper subalgebras have cardinality strictly less than 
JAI. The question is, for what infinite cardinals « is there a J6nsson algebra 
of cardinality «? I mention this here because of a connection with the square 
brackets. As pointed out by Shelah much later in the game, there is a Jonsson 
algebra on « if and only if « ~ [«]$” holds. 

I heard the problem from Tarski in 1964 and when I returned to Hungary 
and met Paul, we immediately had some remarks about this which we 
published in [45]. First we proved that if there is a Jénsson algebra on &, 
then there is also one on Kt, and hence there is one on N,, for n < w. We also 
proved that 2" = «t implies that there is a Jonsson algebra on Kt since we 
knew that 2° =Kt > «Kt » [k*]?,. 

I must also mention that it was already proved by Kiesler and Rowbottom 
that there is a Jonsson algebra on every « if V = L [KR]. 

It was a metatheorem for the two of us because of Rado’s theorem that 
“nothing is true for infinite exponents”. So we proved already in [24] that 
Free(k, 2, No, No) fails for every « and in [43] we strengthened Rado’s result 
tok + [No]d8, - In this spirit we also proved in the Jonsson algebra paper 
that there is an infinitary Jénsson algebra on every «& in other words 


K~* [K]R° 


This became one of our best used theorems. Kunen used it for a simple proof 
of his famous theorem that there is no nontrivial elementary embedding of the 
universe into itself (disproving the hoped for existence of Reinhardt cardinals) 
and Solovay used it in his proof that GCH holds at every singular strong limit 
cardinal above a strongly compact cardinal. 

Our “metatheorem” is not quite true since Foreman and Magidor [FM] 
recently proved that it is consistent that N3 — [No]? It goes without 
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saying that Erd6ds always assumed the axiom of choice, and I would not even 
mention this except that it happens that partition relations with infinite 
exponents may be true if we do not assume the axiom of choice, and 
indeed they became an important tool of set theory e.g. in investigations 
concern ing the Axiom of Determinacy and its consequences to descriptive 
set theory. 

Back to the previous section, Shelah [S5] recently published quite a few 
theorems extending the class of cardinals « for which there is a Jonsson 
algebra and then later proving the stronger result k ~+ [«]?. I presently 
do not know of any instance of the result where K+ ~ [K*]SY is true but 
Kt — [K*]?, is not known. 


11.4. Negative Stepping-Up 


This result published in [43] says that, if r > 2, « > w and & » (,,) 
then 


i 
v<y? 


2° (Ay + Le, 


provided the sequence X, satisfies certain simple conditions. The simplest of 
these is that two of them are infinite and one of them is regular. There are 
about six more conditions to cover relevant cases. These conditions become 
less restrictive as r grows, and there is no condition at all for r > 5. 

But even the one just stated tells us that the Erdés-Rado theorem of §7 
is best possible, i.e. 

eXPp_1() (67 )3 
for n > 2, since the result 2" + («*)3 can be lifted by induction on n. 

Let me state another example. We know that if « + (K*)3 then Kk + 
(k,4)?. This should imply 2" + («,5)4, but to get this, a special argument 
is needed say if & is singular. 

Maybe the negative stepping up is true without any conditions at all on 
the A,, but to the best of my knowledge this is still wide open. There are cases 
where we do not know what happens without GCH for n = 2. Let me explain 
this with an example. It is very easy to see that N®° ~ (Nw41, (No)x,)?, but 
this should still be true if the No entries are replaced by 3’s, and indeed we 
did prove this with GCH 


Not (Quaay (3)x0)?- 


To stick my neck out again, it seems inconceivable to prove this in ZFC, but 
no consistency proofs are known in the other direction. 

Now a trivial canonization lifts this say to the first singular cardinal with 
cofinality No41 ie. to Nosy, (Nuw41 (3)R,) and this should be stepped up 
to 


Roop tl nad Cee (4)xo)°. 
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Unfortunately in this case, for r = 2, only one of the entries is infinite and 
even that is singular. So we had nothing to cover this case and it was stated 
as one of our open problems for a long time. I thought Shelah and Stanley 
had a proof of this + from GCH, but I understand that it is still open. 

A more significant problem is that our result suggests a negative stepping- 
up, for square brackets and set mappings as well. 

It was recognized early in the game that for square brackets this is 
consistently false even assuming GCH. For example, 2° = Xi > XN, ~ [N ilk 
but it is very easy to see that Ny » [Ni]g, implies No  [NiJf, 9, the negation 
of Chang’s conjecture, which was proved to be consistent in an early paper 
of Silver. 

Stevo Todoréevié worked out stepping-up methods from combinatorial 
principles known to hold in L, which do give the stepping-up for square 
brackets and for set mappings in most cases. See [T2] and also [HK1] for 
more history. 

Let me conclude this section with two more interesting recent results 
of Todoréevié which show the present direction of research in this area. He 
proved that Xz > Rik, is equivalent to Chang’s conjecture in ZFC (without 
assuming CH), and Ny  [Ni]x, is true in ZFC [T3]. See also [57]. This is a 
very deep result, but Erdés had a hand in initiating of this type of theorem 
as well, in [81] we remarked that the stepping-up method yields 2®! ~ [X1]} 
in ZFC. 


11.5. Polarized Partition Relations 


While working on the triple paper [43], we had to draw the line somewhere, 
and we decided that we will only include results for polarized partitions of 


the form 
1,1 
K caf Ko Ai 
) bay 


and we gave a number of results assuming GCH. Some of the results inherent 
in the methods were only stated in the second problem paper [81]. But the 
simplest problem we isolated was if 


Xs e oat 
Qko = 
and i ~ (ae 


holds for No < p, v < Nj. 
One of the first results proved after our problem list became public was 
due to Karel Prikry [P]. I state a special case: 


(r) = Gn) ee) «ER, 
*” or even *» 
No Ni No Xt Ni 


is consistent with ZFC and GCH. 


396 Andras Hajnal 


Later, Richard Laver [L] proved that relative to a very large cardinal it is 
consistent with GCH that there is an w; complete ideal J on w; having the 
following strong saturation property: Given F Cc I+, the complement of J, 
|F'| = No (ie. N2 large subsets of Xi there is an F! C F, |F+| = No, such that 
the intersection of any countably many sets in F'! is in I+. This easily yields 


(SG), em Ge) > Ge), 
ven ‘ 
Ny Ny 2 Ny Ny Xo 


and it was one of the first corollaries of Jensen’s morasses, that Prikry’s result 
holds in L. 

For lack of space and time, we did not include polarized partitions in the 
book [100] so there is no comprehensive account in the literature about the recent 


results. Let me state one problem of the form Ca ) > ( . ) ‘Which is unsolved 
and for which there are no consistency results either. Does GCH imply 


CeCe), 
Rar Ror 2 


A small hope here is an unpublished remark of Shelah from 1989. Assume 
(Ka 1 @ < Wy) is an increasing sequence of measurable cardinals, K = sup, Ka 


and 2" = «Kt then 
Ae 1,1 
(“ ) = @ holds. 
kK ae 


Added in proof (March 1995). In September 1994, Shelah proved the 
following striking result. Assume & > cf(K), « is strong limit and 2" > Kt. 


Then 
a 1,1 
é ) = (") holds for T < &. 
kK KJ, 


Writing up the second problem paper [81], I realized that our theorem in [43] 


yielding () as ana from CH can be generalized to give 


No Ni 1,1 
9% = > 
(4) > Ga), 
but it is consistent with CH that 
>t), 
aa 
No Ny 4 


holds. A recent still unpublished result of J. Baumgartner using a new kind 
of argument, says that assuming CH (but no more of GCH) 


() > Ge), 
No N1/ ny 


holds. 
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11.6. Property B and Incompactness 


Our second major joint paper [33] is about the following property of families of 
sets, F': There is a set B which meets every element of F' but does not contain 
any member of F as a subset. This means, in a terminology introduced later, 
that the chromatic number of F' is two. Before stating some results I want to 
tell how we came across property B. Property B was actually discovered by 
Felix Bernstein in 1908. He proved that for every & > w, if F is a family of 
size « of sets of size « then F' has this property. (He used it to get a subset 
BCR, |B| = |R \ B| = 2"° and such that neither B nor R \ B contains a 
perfect subset of R. 

In those years I often visited Erdés at the summer house of the Academy 
in Matrahdza (a summer resort in the mountains), where he used to spend 
part of the summer with his mother. The place was reserved for members of 
the Academy and I was still young, so I had to find a place in the village 
for a couple of days. But I did get decently fed in the summer house during 
the day time. Usually there were other visitors or regular inhabitants to also 
work with Paul, and he would do this simultaneously. He led his usual life 
there, alternately proving, conjecturing, playing chess, ping pong, bridge, or 
walking to mountain tops. It was his habit to stop playing abruptly, when 
the rest of us were warming up to the game, and to return to work. In those 
days he went to bed around ten o-clock, but he woke up early, between four 
or five in the morning, so it was actually safer for me not to be living too 
close. 

There was a vague plan to write a book on set theory and I arrived with 
a number of old journals. One of them was the 1937 volume of the Comptes 
Rendus Varsowie containing a long paper of Tarski, “Ideale in Vollstandigen 
Mengenkorper” , in which we wanted to find something for the planned book. 
Erd6s volunteered to look it up. I had something else to do and I left him 
alone for a while. When I returned, he was excitedly reading. But not Tarski’s 
paper, it was a forgotten paper [Mil] of an American set theorist E.W. Miller, 
which was next to Tarski’s paper in the same volume. (Yes, the same as in 
Dushnik-Miller.) Miller proved that, for n finite, if F’ is a family of infinite 
sets and any two members of F intersect in at most n elements, then F' has 
property B. “Reading” meant reading the statements and trying to figure out 
the proofs. After a while, I gave up and started reading the paper in detail. 

The proof was by a cardinal induction on & = |F'|, the size of the family, 
and for a given «, the underlying set was split into the increasing continuous 
union of « smaller sets {Ag : a < K}, each Ag, being closed with respect to 
certain operations. In this case, for each n + 1 element set, there is at most 
one set containing it, and the elements of this (possibly non-existent) set were 
the values of these operations. Then the induction hypothesis was applied to 
the families F'|A,. This is called nowadays, the method of elementary chains. 
Miller actually proved that F has the stronger property B(< No), i.e. there 
is a set B which meets each element A of F in a finite set. 
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Paul’s only comment was: “You see there are still things we do not know,” 
and before we actually read all the details, he started to ask questions. What 
if the sets are only almost disjoint (have finite intersections)? There is a 
counter-example on the second page of Miller’s paper, and I tried to return 
to the details. “Yes” he said, “but we should then assume that the sets are 
bigger.” So, instead of collecting data for the book, we wrote a long paper. 

Let me state a special case of one of the main results: Assume GCH. If F 
is a family of very strongly almost disjoint sets of size No, i.e. |ANB| < Xo for 
AZABEF, then F has property B(< Xz). More importantly, if F consists 
of sets of size 1 just strongly almost disjoint, i.e. |AN B| < Xo for AA B 
in F, then F still has property B(< &2) provided |F| < Xu. 

The reason why the proof broke down for X41 was quite clear. In the 
generalization of Miller’s proof we had to use infinitary operations, and alas 
nXo is greater than X,, no matter what we assume. 

We both felt that this is a real hard-core problem and we tried to find 
other methods. In doing so we formulated the following statement (*): For 
a < Nu4i there is a partition of @ = Un<wSan into countably many pieces 
such that |San| < Nn and for any a < Nuqi with cf(a) = wy there is an 
increasing sequence (a, : v < w1) of ordinals a, — a such that for each 
n <w the sequences {Sq,,,:V <1} are increasing as well. 

Of course, we could not prove this, but we could deduce from it the 
theorem for |F'] = Xw41. 

Later a young German set theorist W. Donder pointed out that our 
statement is an easy corollary of Jensen’s Oy,, and as a corollary of this 
statement and some obvious generalizations, the theorem for families of sets 
of size &; is true in L. 

In 1986, in a paper with Juhdsz and Shelah [HJS], we proved that it is 
consistent, relative to a super compact cardinal, that there is a family of size 
Roti of strongly almost disjoint sets of size %1 not having property B and 
also GCH holds in the model. 

Paul was of course immediately asking if in Miller’s theorem B(< No) 
can be replaced by B(k) with some k < w. Let us consider families F 
of countably infinite sets, such that for any two A # B e€ F, |AN 
B| < n < w. First we proved that for countable families F, F has 
property B(n + 1) but not necessarily B(n). Then much to our surprise, 
we proved using GCH that, if |F] = %,, k < w then F must have 
property B((k + 1)n + 1) but not necessarily B((k + 1)n). The reason 
for the surprise was, that these were strong incompactness results saying 
that there is a family of size X,41 of countable sets not having property 
B((K + 1)n + 1) but every subfamily of size 8; has this property and such 
incompactness results were not then in the literature (but we already knew 
of the Hanf-Tarski result by the time we finished the paper). At the end 
of the paper we gave a long list of incompactness problems for Nj which 
were later solved by different authors. Eventually, Paul’s persistent interest 
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in these problems led to Shelah’s celebrated compactness theorem for singular 
cardinals [S3]. 

I just state here one of the problems, the fate of which I will describe in 
§10.9. Does there exist a graph on X2 vertices having uncountable chromatic 
number, such that all subgraphs of size X, are at most Xo chromatic? 

Finally, let me mention that due to the interest of Paul, property B had 
an even bigger career in finite combinatorics. But fortunately, this is not the 
subject of this note. 


11.7. Chromatic Number 


In our paper [42] we discovered r-shift graphs. Reference [42] is “Some 
remarks on set theory IX”. Its subject is a general problem involving reals, 
so I hope it fits into Komjath’s paper. But I must mention that a few years 
ago, Fremlin and Talagrand obtained some very interesting results solving 
most of the problems stated there [FT]. 

The vertices of the «, r-shift graph G(K,r), 2 <1 <w are the r-tuples of 
« or rather the increasing sequences {ag,...,@r—1},Q9 <... << Q,p_1 < K and 
we join {a9,@1,...,@,—1} and {a 1, a2,...,a,}. We proved, as a corollary of 
Ramsey’s theorem or the Erdés-Rado theorem, that these graphs have large 
chromatic number and that they do not contain odd circuits of length less 
than r+ 2. 

It was an early result of finite graph theory that there exist graphs having 
large chromatic number and not containing a K3 (see [46] for historical 
references). I think G(n, 2) is the simplest example of this and we were both 
surprised that this was not known earlier. Paul was always interested in this 
problem. He proved in 1959 using his probability method that for all k < w 
and r < w there are graphs of chromatic number > & and of girth > r 
(not containing circuits of length < r). He was always interested in infinitary 
generalizations and in [22] he proved with Rado that, for x > w there is a 
k3-free graph on « of chromatic number k. 

These results again suggested the wrong generalization, but this time 
we were not defeated. In [46] we proved that a graph not containing Cy (a 
circuit of length 4) has chromatic number at most No. In fact, we proved 
a much stronger result. We defined col(G) the coloring number of G as the 
smallest cardinal « such that the vertex set of G has a well ordering such 
that for each vertex + the number of edges having x as the larger element 
is smaller that «. This concept was later introduced in finite combinatorics 
under a different name as well (G is k-degenerate if col(G) < k +1) [Bo]. 
Obviously, chr(G) < col(G) and we proved that if G does not contain a 
complete bipartite graph Ky,.x,, for every k < w then col(G) < No. We used 
the cardinal induction method described in the previous section. Again, the 
problem arose, what can be said if only larger complete bipartite graphs are 
excluded? Let me again state a special case of our result. Assume GCH. If G 
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does not contain a Ky,.x, then chr(G) < Ne and if G does not contain a 
Kyo.x. then chr(G) < Xi provided |G| < Nw. 

The situation is analogous to the one described in the previous section. 
It is consistent that the second clause of the theorem is true for every G e.g. 
if V = L, and it is consistent (relative to a super compact cardinal) that the 
result strongly fails for X.,41 ie. there exists a graph G on X41 of chromatic 
number NX» not containing a Ky, x,. This was shown in our paper with Juhasz 
and Shelah mentioned in 10.6. The construction of this example from the one 
described there is a combinatorial argument, which uses that in the model 
we have many instances of > (the diamond principle). 

I have to mention that we also introduced generalized Specker graphs to 
show that for « > w there are graphs of size «, with chromatic number k 
having large odd girth. 

There are quite a few generalizations of our theorem for col(G) > Xo but 
I do not state these here, instead I offer the references [K 1, HK3] and [101]. 
Let me mention one typical Erdés question: Does chr(G) > Xp imply that G 
contains all large odd circuits, say of length 2k + 1 for k > ko for some ko. 
Note that this is a typical problem where it is the chromatic number that has 
to be large as col(K’y,,x,) = Xi. Later we proved this with Shelah in [79]. 

Rado asked if the de Bruijn-Erdés compactness theorem for finite 
chromatic number extends to finite coloring numbers. As the definition of 
the coloring number involves a well-ordering this can not be expected. Indeed 
we disproved it, but a surprising result of [46] is that still there is a uniform 
bound. We proved: If col(G’) < k(2<k <w) for every finite subgraph G’ of 
G then col(G) < 2k — 2, and there is a countable graph to show that this is 
best possible for each k. 

There is an important finite theorem hidden at the end of [46]. We proved 
there, using the probabilistic method, that for every r, s, k, there are r- 
uniform hypergraphs of chromatic number greater than k and girth greater 
than s. In fact, defined on some n-element set, they do not contain an 
independent set of size n1~¢ for some d > 0. This fit logically into the line of 
thought of [46] and it did not occur to us that no finite combinatorialist will 
look at, much less read, a 40 page paper full of alephs, to find an interesting 
probabilistic argument on the 35th page. 


11.8. Another Miss 


We first met Eric Milner in 1958 at the IMC meeting in Edinburgh. He was a 
former student of Rado and was working in Singapore. Rado interested him 
in partition problems and he settled one of their problems about countable 
ordinals [M1]. That was enough to induce Paul to visit and work with him in 
Singapore in 1960. He returned from there to Budapest with a new interesting 
problem which I solved and this began a long collaboration between the 
three of us. The Milners’ returned to England in 1961 and Eric joined Rado 


Paul Erdés’ Set Theory 401 


at Reading. On my way back from Berkeley to Budapest in 1965, I stayed in 
Reading for a month with them discussing a long half-finished manuscript. 
Although our long joint papers only appeared a few years later, in 1965 
we were already deeply involved in our joint work and we thought it would 
probably help if all three of us could be together at the same place and at the 
same time. So it was arranged that Eric should visit us during the summer of 
1965 to spend a week at the summer house of the Writer’s Union in Szigliget 
on Lake Balaton. Eric arrived with an interesting question about transversals, 
and as a consequence, instead of regularly working on manuscripts, we wrote 
another shorter paper [56] which became our first joint work to appear. As 
a side issue in that paper we proved the following theorem: Let \ > cf(A) = 
kK >w, and let A\g(a < &) be an increasing continuous sequence of cardinals 
cofinal in A, and assume that t® < X fort < x. If S is a stationary subset 
of & and F C MaesAa is an almost disjoint set of transversals (i.e. |{a ES: 
f(a) = 9(a)}| <Kforf #9 €F), then |F| <r. 

Eric made notes of our results and wrote it up and the pap er appeared in 
1968. We forgot the whole thing, and the paper seems to have gone unnoticed. 
Even in 1967 when we wrote the first problems paper with Paul, where our 
intention was to write down all our interesting problems, we omitted any 
mention of this. However, it seems we were not the only blind ones. During 
the summer of 1971 Adrian Matthias organized a large conference on set 
theory in Cambridge, England. Karel Prikry was one of the invited speakers 
and he gave a talk on a generalization of Jensen’s work on Kurepa families. 
He discovered the following result: Under the assumptions of our theorem, if 
H C P(A) is a Kurepa family in the sense that |H|Aq| < Aa forae S (Sa 
stationary subset of Kk), then |H <r. (H|Ayg = {HN Av: HE H}.) 

He told me this result the day before his lecture and it sounded vaguely 
familiar. But it took me the whole day to realize that this was just our earlier 
theorem applied to the sets H|A, in place of Aq. I managed to get a copy 
of our paper to give to Prikry before the lecture. Now there were about 
one hundred set-theorists in attendance, including all the leading ones, when 
Karel stated our result in a totally digestible form. But nobody asked, what 
happens if we replaced Aq by AZ? I suppose the psychological barrier was too 
strong. In 1974, just before the ICM in Vancouver, I was visiting Eric again 
in Calgary (he moved there in 1967), when I received a preprint of Silver ’s 
ingenious discovery that: if A > cf(A) > w and if 27 = rt on a stationary 
set of cardinals T < X, then 2* = d+. At the same time I received a preprint 
from Prikry giving a combinatorial proof of Silver ’s result. Prikry told me 
that the instant he saw Silver ’s manuscript it dawned on him that the only 
thing needed was to lift our old result with At in place of \q. Of course 
this requires a non-trivial argument. Baumgartner and Jensen also found 
elementary proofs of Silver ’s result without remembering our theorem. But 
the real miss, and so uncharacteristic of Paul, was not to have asked the 
question! 
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11.9. Incompactness for the Chromatic Number 


In 1966, assuming CH, we solved the problem on chromatic numbers stated 
in $10.6. We proved in [54] that there is a graph of chromatic number at 
least N; on (2%°)+ vertices all of whose subgraphs of cardinality at most 
20 have chromatic number at most No. This also comes with a story and 
some advice. During a working session at Paul’s apartment, we were talking 
about something totally unrelated to chromatic number and compactness. In 
the middle of an attempted proof, we found that the pairs of R are colored 
with countably many colors and our proof would be finished if there was a 
monochromatic increasing path of length 2, [P, i.e. a triple x; < x2 < 23 
with {1,22} and {x2, x3} having the same color. Unfortunately there was 
not, and I tried to get another proof. But Paul started to insist that we should 
know for what order types 6, 


6 -» (IP2)? 


holds. We parted unsuccessful in both attempts. But on the way home, I 
could not help thinking about his question. I remembered an old idea of 
Sierpinski which easily implied that there is a ~ for every @ of cardinality 
|0| < 280. Then I saw in a flash that this just says that all subgraphs of size 
2% of our shift graph G((2%°)+, 2) have chromatic number < No. As this was 
a 100 dollar problem, I immediately called Paul when I arrived home. (I think 
eventually I got only $50 for it, but with some reason). The advice is this: 
just answer his questions, you have time later to ponder if it is important or 
not. 

This was the status of the problem when we published it in [67]. Let me 
tell some later developments. First Jim Baumgartner proved with a forcing 
argument that it is consistent that GCH holds and there is an X2-chromatic 
graph on Xo vertices, such that the chromatic number of every subgraph of 
size X, is at most No [B]]. 

M. Foreman and R. Laver proved that: it is consistent with GCH relative 
to a large cardinal that every graph on Xo, all of whose &, subgraphs are 
No-chromatic is at most X%1-chromatic [FL]. 

Finally, Shelah proved, after improving his own results several times, that 
it is consistent (true in L) that: for every regular non-weakly compact k, there 
is a K-chromatic graph on « all of whose subgraphs of size less that k are No- 
chromatic [S4]. 

We also invented an interesting graph in [54]. Let C(w2,w) be a graph 
whose vertices are the elements of “?w, i.e., wa-sequence of integers and 
we join two sequences if they are eventually different. We proved, and 
these are obvious facts, that every subgraph of size Xi of C(we,w) has 
chromatic number < Xo, and moreover, that every graph of size Xz having this 
property embeds into C'(w2,w). However, the chromatic number of C(we, w) 
in ZFC is a mystery. Our result implies that CH=> chr(C(w2,w)) < Xi 
and Péter Komjadth proved this from the weaker assumption 2° < Ng, and 
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he proved it consistent with GCH that chr(C(w2,w)) = N3 [K 2]. On the 
other hand Foreman proved it is consistent relative to a large cardinal that 
chr(C(w2,w)) < Xi [Fo]. It seems to be out of the question with the present 
methods to prove that our graph is consistently No-chromatic. 


11.10. Decomposition of Graphs 


I just want to mention our paper [53] which appeared in 1967 but was 
written about 2 years earlier. In this paper we raised problems of the following 
type. Does there exist graphs G not containing a K), for some cardinal 4, 
such that for every vertex partition or edge partition with few colors say a 
monochromatic K, appears. In present notation: For what A, 7, y is there a 
K)-free graph G such that 


G — (K,), or G > (K,)2 


holds? We had some results but I just want to restate two edge partition 
problems from the paper, one of them finite. 

Does there exist a finite Ky-free G with G > (K3)3? This was solved 
by Folkmann [Fol] affirmatively, but the question became one of the starting 
points of structural Ramsey theory (see §15). 

The infinitary problem is the following. Does there exist a K4-free graph 
G of cardinality (2%°)+ such that G > (K’3)?, holds? As far as I remember, 
this was the last set theory problem Paul offered a prize for (it was worth 
$250.) Shelah later proved this to be consistent, but I will speak about the 
status of this kind of problem in a more general context in §15. 


12. A-Systems and More Set Mappings 


A-Systems were introduced in a paper of Erdés and Rado which appeared 
in 1960 [26]. A family F of sets is a A-system if there is a set D, the kernel 
of F’, such that AN B = D for all A # B € F. The paper set the task to 
determine A(«, A) = 6 the smallest cardinal for which every family F’, of sets 
of size & and cardinality 6 contains a A-system of size A > 3. As it is well 
known, for finite «, the problem is still unsolved. A $1,000 reward is offered 
by Paul for the proof or disproof of the conjecture that for some c > 0 


A(kK,3) <c* fork <w 


However, for & > w, Erdés and Rado settled the problem completely. 
Although some of the details were only cleared up in their second paper 
[60] on the subject, the main upper bounds were already obtained in [26]. 
One of the main results says that if x < 6 = cf(d), |F| = 6 > w and 6 is 
inaccessible from kK, t.e. 0% <6 fora <6 then F' contains a A-system of size 
6. This is probably the most frequently used theorem of set theory, since it is 
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the simplest tool to prove that certain partially ordered sets satisfy certain 
chain conditions. If (P, x) is a partially ordered set p,q € P are incompatible 
if there is no r € P with r X p,q and P satisfies the K-chain condition if every 
subset of pairwise incompatible elements has cardinality < «. It was already 
an important element of Cohen’s proof of the independence of the continuum 
hypothesis, that finite 0, 1 sequences from any index set, ordered by reverse 
inclusion satisfy the No-chain condition. Cohen and his early followers did not 
know the Erdés-Rado theorem and they proved it for the special cases they 
needed. But soon it was discovered by logicians, and it is invoked almost any 
time forcing is used. 

There is another theorem of Erdés and Specker [30], I should have 
mentioned in §5, which is used almost as often in forcing arguments to 
establish chain conditions as the A-system theorem. Assume f : & > P(k) 
is an ordinary set mapping. In §5 we saw that, if If |f(x)| < 7 < « for some 
cardinal tT < « then there is a free set of size & and « is the union of 7 free 
sets. Now if « is a successor cardinal AT then the assumption that the type 
tp(f(x)) < € < «+ for some fixed € < \* is weaker than the assumption that 
| f(x)| <7 for some cardinal t < « but by the Erdés-Specker theorem, this 
still implies the existence of a free set of size «; however, Fodor’s theorem 
does not apply since the graph induced by f can be At-chromatic. Most of 
the time that we want to construct or force an object on At such that each 
subset of size A+ contains a subset of size At of certain kind, but the whole 
set is not the union of X sets of this kind, then the Erdés-Specker result is 
the first thing to remember. 


13. The Unsolved Problems in Set Theory [67] 


In 1967 the first major post Cohen conference was held at UCLA. By that 
time, Cohen’s method was generally known and developed, and the aim of 
the conference was to bring together all experts of set theory and to collect 
and make public all the fantastic new results available. We were both invited, 
Paul was there, but I could not make it. (It was the only time I did not get 
a passport from the Hungarian authorities.) The organizers convinced Paul 
that instead of mentioning a few interesting problems as the spirit moves him, 
he should write up all the difficult problems he came across during his work 
in combinatorial style set theory. He immediately promised that we will do it 
in a joint paper. This time we worked hard and fast. A mimeographed version 
of the manuscript containing 82 problems (or groups of problems really) was 
ready in the same year and we sent a copy to everyone we knew and who 
we thought would be interested. It included all the problems I mentioned in 
the previous sections and quite a few more. A large number of (then) young 
mathematicians started to work on these, and produced solutions either by 
applying the newly developed methods of independence proofs or simply 
divising new combinatorial methods. The paper only appeared 4 years later 
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in 1971, and by that time the status of most of the problems had changed. 
We tried to keep the manuscript up to date by adding remarks, but in 1971 
we decided to write a second problem paper [81] which contained the status 
of the problems up until that time. 

It would clearly be impossible to write a similar survey today. In the 
previous sections, I tried to show on selected topics how the Erdés problems 
generated new questions and results and how they became integral parts of 
modern set theory, and how many of them are still alive. In this section I can 
only mention the status of a few more which I omitted earlier. 

I did not finish the story of set mappings of type < w. Shortly after our 
problem paper was distributed Jim Baumgartner proved in his thesis [B2] 
that if V = L then Free(k, 2, < w, Xo) is equivalent to K — (No)5“ but on the 
other hand, it is still open if it is consistent relative to a large cardinal that 
Free(Ny,2,< w,No) holds, or more strongly, there is no Jonsson algebra on 
X.,. As I already mentioned, Free(X.,, 2, < w, No) was our first joint problem. 
We already suspected at Kalmar’s supper that it will be hard, but probably 
not quite as hard as it turned out to be. 

I am afraid I have mentioned too many problems which led to indepen- 
dence results, so here is a difficult theorem of Shelah and Stanley solving one 
of our problems in ZFC: 


(QPP) a > ((2%*)+w,n)? 


for n < w [SS2]. It is another matter that they also proved w3.w,; — 
(w3.w1,3)? to be independent of ZFC and GCH. 

Erdés proved with Alaoglu in 1950 in [5] that if « is smaller than the 
first weakly inaccessible cardinal greater than No, then one can not have 
No a-additive 0, 1 measures so that every subset of S is measurable with 
respect to one of them. Erdos attributes the question to Stanislaw Ulam but 
he got the first result. We asked if No can be replaced by Ny here? Prikry 
proved it to be consistent, but this question became the forerunner of so 
many questions in the theory of large cardinals that I do not dare to write 
about later developments in detail. 

Instead, here are some evergreen problems from the theory of ordinary 
partition relations for ordinals. 


1. w’ — (w”,3)? was proved by C.C. Chang [C] and w* > (w*,n)?n < w 
was proved by E.C. Milner [M2] and independently by Jean Larson [Lar]. 
But w” > (w**,3)? or w** — (w®*, 3)? seem to be as hard as ever. Here 
of course a? means ordinal exponentiation. 

. Does there exist an a with a — (a,3)? such that a + (a, 4)?? 

3. I proved with Jim Baumgartner in 1970 [BH] that ® + (w)} > Va < 
wivk < wh — (a)?%. But for exponents >2 very little is known. For 
example, w; — (a,4)° is still open for a < w . The world record is 
presently held by Milner and Prikry; they proved this for a < w.2+1. See 
[MP]. 


i) 


406 Andras Hajnal 


4. Is w2 + (a)3 for a < we consistent with GCH? I proved the consistency of 
Ww. + (w,-+w)3 and it follows from the existence of Laver’s ideal mentioned 
in §10.5 that wa — (w.2)3. 

5. It follows from a recent result of Baumgartner, myself and Todoréevié 
[BHT] that GCH=> ws + (we + €)? for € < wi and k < w but w3 > 
(w2 +2)? is still open. See [BHT] for many new problems arising from our 
results. 


14. Paradoxical Decompositions 


Erdés has 12 major joint papers with Eric Milner, nine of those were written 
by the three of us. These are from a later period so the results and problems 
are more technical than the ones I described earlier, it is out of question to 
give a list of them. I want to speak about one idea which features in quite a 
few of them. 

It was always clear that Ramsey’s theorem is a generalization of the 
pigeonhole principle of Dedekind. When partition relations K — (A,)" 2 
were formally introduced, it became apparent that the pigeonhole principle 
is just a partition relation for cardinals with exponent r = 1. For example, 
nk +1 — (n+ 1); for the finite case with k boxes, and No > (No); for 
k <w, and more generally, « — («)} for A < cf(K), K > w. It was discovered 
by Milner and Rado in [MR] which appeared in 1965 that the pigeonhole 
principle does not work the same way for ordinals. They proved that for any 
K2W 


(i) E+ (K")he, if € < Kt and as a corollary of this € + (K”)} for € < Kt. 


Here again a? denotes ordinal exponentiation. This phenomena, often called 
the Milner-Rado paradox, has to be kept in mind, just because it is so 
contrary to one’s first intuition. When partition relations proliferated it 
was discovered that this (as almost anything) can be written as a polarized 
partition relation: 


x Ww 1 w oe 
(ii) (2) » (4) fore = Ke 


and also as a square bracket relation: 


(iii) € + [KY], ex, for € < xt. In [63] we have investigated the polarized 
partition relation 


K tay 
¢ * afer for K =w and k = 41, € < wo. 


We gave a complete discussion, relying heavily on the form (iii) of the 
paradox in the case kK = wy}, i.e. 


(iv) € + [wi ]ko ex, for € < we. 
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When we tried to lift our results to higher cardinals we realized that we 
would need to generalize (iv) to 


(v)E~ [Ko ]Ra to for € < w3. 


We already discovered in 1967, that this will not be possible in ZFC, 
but we only wrote down the results which we called the N2-phenomenon in 
our 1978 paper [93] relying heavily on other people’s results. See [93] for 
references. 

Since this is not so well known, I will write down the N2-phenomenon as 
it relates to (v). 


A.l € » [k9"]k, 1», holds for € < w5? 

A.2 If 281 = No then for some £9 < w3, & > [wi Ties No 

A.3 It is consistent with 2°+ = Nz that € > [ws Thy to holds for € < wg 

AA w5? + [WS] x, and wo? » [we’]k. x, are both consistent with ZFC 
and GCH. (The ~ holds e.g. in ZL while the — follows from Chang’s 
conjecture.) 


All this happens because a counterexample establishing the ~ is really 
a sequence {A§ : a < wi} C € such that the order type tp(Uge,,4s) < 


ger for a function fe : w; — w; and, for ¢ < €, f¢ must be smaller than 


fe in some well known ordering of these functions. In fact, this was the reason 
why we asked all the problems 19A—19E in the unsolved problems paper, 
about the relation of the transversal hypothesis and the Kurepa hypothesis. 

Problem 19D was slightly out of the line there. Typically, Paul asked 
something that was quite new: are there 2®: almost disjoint, stationary 
subsets of w,? It is easy to see the consistency of a ‘yes’ answer, it is true 
e.g. in L, however the consistency of a ‘no’ answer with CH is not completely 
proved. Foreman, Magidor and Shelah proved in [FMS] that ‘no’ follows from 
a consistent set-theoretical principle called Martin’s Maximum (MM), but 
MM implies that 28° = Ny. They also proved it consistent with CH that 
there is a stationary subset of w; on which the nonstationary ideal is No- 
saturated. All these very difficult consistency proofs of course are relative to 
the existence of some large cardinals. 


15. A Mistake and Its Consequences 


In $12 of our paper [46] about chromatic numbers, we claimed a false theorem. 
I just state a special case. Let H = (h, H) be a 3-uniform hypergraph (i.e. 
H c [h]?) such that every pair e € [h]? is in at most countably many elements 
of H. Then we claimed that the chromatic number of H is at most No. 

As we know now, this is true if |h| < &; and false for a triple system 
of cardinality (2*°)+. Now I have to disclose a not so surprising secret. Paul 
actually wrote up some of our joint papers, but these were the short ones. For 
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the long ones it was my job to prepare the manuscript, but we always read the 
manuscript and even the proof sheets together. The trouble was that he often 
got bored with mechanical work like this, and he made up new conjectures 
and theorems and insisted that we should include them by adding remarks— 
even to the galley proofs. Taking the responsibility, I think I was the one who 
overlooked that the cardinal induction method breaks down from Nj to No 
in this case. Anyway, if the theorem was really true, the whole structure of 
the paper should have been changed but fortunately we did not have time 
for that. 

As usual, I forgot the theorem, but Paul did not. I got a phone call from 
him from abroad about 4 years after the paper had appeared. He was trying to 
tell the proof of it to Bruce Rothschild, and got stuck. They soon discovered 
a counter-example. Let (2%°)+ = x, h = [«]?, H = {{{a, B}, {8, y}, fa, y}} : 
a<8<7< «}, Clearly, any two elements of H have at most one element 
in common, and the chromatic number is at least X; by the Erddés-Rado 
theorem. We wrote a triple paper [76] about it. However, Paul got interested 
in this question: what kind of finite triple systems must appear in an X4- 
chromatic triple system? I think the first question was the 6/3, i.e., are there 
three triples with empty intersection such that each pair has exactly one 
point in common? (This question for triple systems made quite a splash in 
finite combinatorics as well.) Fred Galvin came up with a negative answer. 
Later Fred spent the academic year 1972-1973 in Budapest, and the three of 
us started to work on this problem and we asked the same question for triple 
systems not containing large independent sets. Unfortunately, we did not 
find a general answer, maybe there isn’t one, every time that we constructed 
a large chromatic system avoiding concrete finite systems, Paul ingeniously 
invented new ones for which the construction did not work. The motivation 
behind this was the following. Clearly there is a cardinal « with the following 
property. If for a finite triple system H there is a (> No)-chromatic triple 
system Ko not containing H, then there is one of cardinality at most k. 
Cardinals which satisfy a condition like this are quite often impossible to 
determine, and such was the case with this problem. For two triples with a 
common edge the number we found is (2%°)+. With GCH all our examples had 
cardinality Nj associated with them. We ended up with a concisely written 
paper almost 90 pages long [85] containing some really good theorems, but 
which remained relatively unknown. Again, it is not possible to give a list 
of the results, but I do want to mention one concept and problem from the 
paper that I really like. 

We constructed large chromatic r-uniform hypergraphs by induction on 
r, and to support the induction from r to r+ 1, we needed the r-tuple system 
H, to have a stronger property than chr(H) > No. 

Let H, = (h, H,)(v < y), be a system of r-uniform hypergraphs on the 
same vertex set h. The system has simultaneous chromatic number > No if, 
for every partition of the vertex set h = U,,<,, hn into No parts, there is an 
nm <w such that h,, contains “edges” from each H, for v < y. 


Paul Erdés’ Set Theory 409 


We say that a (> No)-chromatic H = (h, H) splits to 6 parts, if there is 
a disjoint partition H = U, 2; Hy so that the system H, = (h, H,)(v < 6) 
has simultaneous chromatic number > Ng. We proved that quite a few known 
(> No)-chromatic graphs split to XN; parts and these served as a basis of our 
induction process. 

In those days, before Todoréevi¢é’s result, we only knew with CH that Ky, 
splits to Ny-parts. Still, as we did not find anything that does not split, we 
asked the question: Is it true that every (> No)-chromatic graph splits to two 
(or X,) parts? 

This problem as it stands is still unsolved. With Péter Komjath I have 
some unpublished partial results. Here are two of them. 


(1) It is consistent that every X,-chromatic graph splits into ®1 
parts. 

(2) It is consistent relative to a measurable cardinal, that there is a (> No)- 
chromatic graph which does not split into %y parts. (We do not know this 
for two parts.) 


16. Structural Ramsey Theory 


As I already mentioned in §10.10, we asked the first questions of the following 
type: Does there exist a K4-free graph G such that G — (K3)3. 

The following type of generalization appeared first in Deuber’s paper 
[D]. Let G, H be graphs; H embeds into G if G has an induced subgraph 
isomorphic to H. With present day partition calculus notation, we say G — 
(H)2 , if for arbitrary colorings k: G > «,@: [g]? \G— X of the edges of 
G with « colors and non-edges with A-colors, there is an induced subgraph 
H' C G isomorphic to H such that k and @ are constant on the edges and on 
the non-edges of H’ respectively. 

Deuber proved that for all finite H and k < w, there is a finite G with 
G > (H)?, and the combination of the two types of questions Paul 
raised became the starting points of the NeSetiil-R6dl type structural Ramsey 
theory. 

With Erdés and Posa we proved the first infinitary result of this kind [88]. 
The paper appeared in the volume of the Keszthely conference held for Paul’s 
60th birthday in 1973, and this volume contains the first Nésetiil Rodl paper 
on the subject. The finitary theory developed very fast. The problem was 
generalized for coloring of substructures of a fixed kind instead of coloring 
pairs, but fortunately I do not have to give an account of this. I just want 
to say that this was not done in the infinitary case because here some basic 
problems are still open. 

In the paper with Erdés and Pésa we proved that for every countable H 
and k <w there is a G (\|G| = 2°) such that G > (1); ,, and asked if this 
holds true for countably many colors, or for larger H. 
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We discovered a decade later with Péter Komjath that it 7s consistent to 
have |H| =w, and G & (H)3, for every G [HK2] and Shelah proved that it 
is consistent that for all H andy there is aG with G > (H )> 4 [S6]. 

In 1989, [H 2] I proved in ZFC that for all finite H and aroiiears 7 there 
is aG with G~ (H Jen but the problem of countable H and countable ¥ is 
open (though the — is consistent by Shelah’s result). The answer may turn 
out to be to Paul’s liking (a theorem in ZFC) but I am sure it will be very 
difficult. 

Shelah generalized his consistency results for K,-free H as well, but at 
this point I feel I have to stop and refer the reader to a recent survey paper 
of mine on this subject [H 3]. 


17. Applications of Partition Relations in Set 
Theoretical Topology 


In the last 30 years, set theoretical topology became a major area of research 
as shown e.g. in the Handbook of Set Theoretical Topology. The reason for 
this is that the new methods of set theory (forcing, large cardinals) made it 
possible to study topological spaces for what they are, namely set theoretical 
objects. The point I want to make is that, although Erdés did not take an 
active part in most of this, combinatorial set theory which he created is one 
of the major tools in this development. 

This happens not just through the applications of positive theorems. 
There are of course some famous ones. Being closest to the fire, with Istvan 
Juhdsz we showed, for example, as a consequence of (2?°°)+ —> (1)? that, 
every Hausdorff space of cardinality (22"° )* has discrete subspaces of size Xj. 
Also, as a consequence of the canonization theorem of §10 that, the spread 
(the supremum of the sizes of discrete subspaces) is attained in a Hausdorff 
space if this supremum is a singular strong limit cardinal. 

More importantly, there are literally dozens and dozens of examples 
obtained as strengthenings of negative partition relations which would never 
have turned up in their present form without a detailed analysis of these 
relations. Let me try to make this clear with an example. I already mentioned 
Prikry ’s consistency proof of 


<> F 
Ni Ni), 


To state it in “human language” (but already a little twisted for my purposes), 
it means that there is a sequence {fg : @ < w2} C¥! 2 such that for all 
countable I € [w2|No there is a v(I) < w, such that for v(1) < v < wy there 
are Q9,a1 € I with fo,(v) =0 and fa, (v) = 1. 

When in [HJ] with Juhasz we discovered HFD’s (hereditarily finally dense 
sets) and proved the consistency of the existence of a hereditarily separable 
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space of power N2 (assuming 2*° = X,) we only had to change the last clause 
of the above statement: there is a v(I) < w1 such that for every finite set F 
with v(I) < F < w, and every 0, -function ¢ defined on F there is an a € I 
such that for ally € F, fo(v) = e(v). And now { fa : a < we} is a hereditarily 
separable subspace of cardinality N2 of D(2)”:. 


18. A Final Apology 


I feel that I should stop at this point. One reason is that this is the 100th 
page of my handwritten manuscript, but there are other reasons. Paul has 
continued to work on set theory, stating new and old problems in the 
numerous problem papers he published. Our last major set theory paper 
with Jean Larson [109] appeared in 1993. It would not really be appropriate 
for me to speculate on the reactions that these latest problems may provoke, 
for we lack the perspective. It is also true, that his interest in set theory is 
slightly diminished, he does not like the technical problems which already in 
the assumptions involve consistency results. But he triumphantly continues 
to carry the flag of Georg Cantor. 

I also have some doubts about my manuscript. It is as if I have been 
trying to sketch a rain forest, but with only enough time and ability to draw 
the trunks of what I thought to be the largest trees. Paul’s real strength is 
in the great variety of those hundreds of small questions which he has asked 
that have given some real insights into so many different topics. I can only 
admire his inventiveness and thank him for everything he has given us. 

Finally, I also wish to thank our old friend Eric Milner for helping me to 
prepare this paper. 
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1. Introduction 


In this Chapter we consider P. Erdés’ research on what can be called as 
the borderlines of set theory with some of the more classical branches of 
mathematics as geometry and real analysis. His continuing interest in these 
topics arose from the world view in which the prime examples of sets are 
those which are subsets of some Euclidean spaces. ‘Abstract’ sets of arbitrary 
cardinality are of course equally existing. Paul only uses his favorite game 
for inventing new problems; having solved some problems find new ones by 
adding and/or deleting some structure on the sets currently under research. 
A good example is the one about set mappings. This topic was initiated by 
P. Turan who asked if a finite set f(x) is associated to every point x of the 
real line does there necessarily exist an infinite free set, ie., when xz ¢ f(y) 
holds for any two distinct elements. Clearly the underlying structure has 
nothing to do with the question and eventually a nice theory emerged which 
culminated in the results of Erdés, G. Fodor, and A. Hajnal. But Paul and 
his collaborators kept returning to the original setup when the condition is 
e.g. changed to: let f(a) be nowhere dense, etc. Several nice and hard results 
have recently been proved. (See Sect. 8 in this Chapter.) 

But this is not Paul’s secret yet. He has the gift to ask the right questions 
and-always-ask the right person. 


2. Sierpiniski’s Decomposition 


One direction of theorems and problems originated from W. Sierpiniski’s 
famous decomposition theorem [34]. This paradoxical statement says that 
CH (the continuum hypothesis) holds if and only if R* can be decomposed 
as AUB where A, B have countable intersection with every horizontal, resp. 
vertical line. (See [14] for a lucid description of Paul Erdés’ astonishment 
when first hearing on this result.) This shows that the generalization of 
Fubini’s theorem { [ f(z, y)dxdy = f { f(x,y) dydz may be false if we only 
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assume the existence of the two double integrals and not the measurability of 
f (take the characteristic function of A). H. Friedman showed that it can also 
be true (if one adds Ng random reals) (see [21]). Later, C. Freiling [20] proved 
that if there is a counter-example to this statement then necessarily there is 
one of the Sierpiniski type, i.e., then R? = AUB such that the intersection 
of A (or B) with every horizontal (vertical) line is a measure zero set. See 
[36] which is a very thorough survey paper on this result and several variants. 
Erdés [11] generalizes this as follows. Suppose the lines of R? are decomposed 
into two classes (and CH holds, of course). Then a decomposition as above 
exists with AM @ countable if @ is in the first class, and BN @ is countable if 
£ is in the second class. 


3. Linear Equations 


An old result of R. Rado states that every R” in fact every vector space 
over Q, the rationals, is the union of countably many pieces none containing a 
3-element arithmetic progression. This observation initiated several different 
research areas. In [18] Erdés and Kakutani showed that every vector space of 
cardinal at most &; is the union of countably many bases and this is sharp. 
Various other results hold if one want to cover a linear space with countably 
many sets omitting the solutions of some linear equations (and not all as in 
the above case) [27]. 

One way of proving the above mentioned Erdés-Kakutani result is via 
a combinatorial lemma of Erdés and Hajnal which also gives that if the 
continuum hypothesis fails and R is decomposed into countably many pieces 
then one of them contains a,a+2,b,b+< for some a,b,xz, x 4 0. Erdés then 
asked if the following complementary result holds. If CH holds then R” can 
be written as the union of countably many sets none containing the same 
distance twice. For n = 1 this is a very special case of the quoted Erddés- 
Kakutani result. For n = 2 this was established by R. O. Davies [7] and 
finally for the general case by K. Kunen [29]. 


4. Euclidean Ramsey Theory 


Perhaps the central topic in our discussion is the one that may be called 
Euclidean Ramsey theory. This deals with decompositions of R” where a 
configuration similar to a given one is excluded. Though in the finite version, 
i.e., when the number of classes is finite, there are several positive results 
(given by P. Erdés, R.L. Graham, P. Frankl, V. Rédl, and others), the infinite 
case abounds in counterexamples. 

To begin with, if the excluded configuration is infinite then there is always 
a decomposition of R” into two classes with no copy of the configuration in 
one class. This nice observation which uses an earlier result of P. Erdés and 
A. Hajnal is well hidden in [15]. As for the case of of finite configurations 
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J. Ceder already in 1969 proved that R® can be colored with countably 
many colors without a monocolored equilateral triangle [4]. The tricky proof 
uses that C, the complex plane is a vector space over the countable field 
Q(/—3). In this vector space any node of an equilateral triangle is a convex 
linear combination of the other two. The proof now goes like R. Rado’s 
quoted proof. It “defines” the coloring of a vector from its coordinates over a 
Hamel-basis. 

P. Erdés asked if Ceder’s result holds for R”, as well. Obviously, the above 
proof won’t generalize. The more classic inductive argument, in the spirit of 
Davies’ results, makes possible to show that a similar statement holds for 
regular tetrahedra in R®, see [23]. For regular simplices in R” this was shown 
by J. Schmerl [31]. Then he extended his result to equilateral triangles [32]. 
Finally, adding some really nice and deep arguments from higher dimensional 
calculus and logic J. Schmerl was able to show that P. Erdés’s conjecture 
is true, there is a countable decomposition of every R” with no isosceles 
triangles in one part [33]. 

One might think that excluding right triangles is similar to the case of the 
isosceles triangles. In fact, it was shown in [19] that CH is actually equivalent 
to the existence of a countable decomposition of R* not containing the three 
nodes of of a right triangle in one piece. Interestingly enough the ‘only if? 
part is an easy corollary of an above quoted Erdés-Hajnal result, the ‘if’ part 
is rather complicated. 

Another variant, also raised by P. Erdés, if one can exclude those triangles 
with rational, nonzero areas. Clearly, we cannot multicolor triangles with zero 
area, and the statement is obvious for one given value of area. In unpublished 
work K. Kunen showed that if CH holds then there is a decomposition of R? 
omitting rational areas as asked. In [28] we extended this to a broad class of 
configurations namely we showed that there is, under CH, a count able decom- 
position of R” with no different points G,...@%, satisfying any p(@,...G@) = 
0 where p is a polynomial with rational coefficients such that p(@q, ...a@) 4 0 
holds for every @ € R”. (Again, this is trivial for one such polynomial.) This 
has recently been extended to ZFC proofs in [33] by J. Schmerl. 

Paul Erdos asked if the following asymmetric variant of the above quoted 
result of [15] is true. R can be decomposed into two pieces, A and B such 
that A omits 3-element while B omits infinite arithmetic progressions. This 
was proved (for arbitrary vector spaces) by J. E. Baumgartner [3] using ideas 
somewhat similar to the proofs of the results of Rado, Ceder, and Schmerl. 


5. The Hilbert Space 


The situation changes radically if we replace R” with the Hilbert space 
é° of infinite real vectors (29,2 + 1,...) with }> a? finite. An observation 
due to Erdés, Kakutani, Oxtoby, L.M. Kelly, Nordhaus, and possibly many 
others is that in this case there are continuum many points with pairwise 
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rational distance. As it is easy not to find a proof I sketch one. Work in the 
Hilbert space where an orthonormal basis is {6,} where s can be any finite 0-1 
sequence. To every infinite 0-1 sequence z associate a(z) = )) Apbz)n where z|n 
denotes the string of the first n terms of z and A, = f/3-2-@+) TE z # z' first 
differ at the (n + 1)-st position then (a(z) — a(z’))2 = S{\? si >n=4-"} 
so the distance between a(z) and a(z’) is 2~”. It is easy to see that every 
triangle in this construction is isosceles. I don’t know if there is a similar 
(or any) construction of continuum many points such that all three-element 
subsets form a triangle with nonzero rational area. (See some problems and 
several remarks on this topic in [12].) 


6. Games 


Yet another variant of these problems is to decompose the spaces by games. 
Let V be the «-dimensional space over the rationals and let two players 
alternatively choose previously unchosen vectors in a transfinite series of 
steps. At limit stages the first player chooses and he has to cover an arithmetic 
progress ion as long as possible. Erdés and Hajnal proved that the second 
player can always prevent her opponent from selecting an infinite arithmetic 
progression. If the second player is allowed to select No vectors she can prevent 
the first player from selecting a 3-element (4-element) AP if & < Ni(K < No) 
but the first player can always cover a 3-element arithmetic progression if 
kK > No. This was generalized by Fred Galvin and Zsiga Nagy who showed 
that the first player can cover an (n + 2)-element AP for k = X, but his 
opponent can prevent him from selecting longer. 


7. Large Subsets 


Another type of problems asked repeatedly by P. Erddés is the following. Let X 
be a subset of the n-dimensional Euclidean space with some infinite (usually 
uncountable) cardinal « Given a property P can one always find a subset 
Y C X of cardinal « with property P. Already in [9] (see also [13, 5]) Erdés 
proves this if P is the property that no distance occurs twice. Notice that 
this is a property depending on finite sets. In [9] it is also proved that if « is 
regular then there is a Y such that all r < n-dimensional simplices formed 
by r +1 elements of Y are of different area (but this fails for e.g., Xu). 


8. Set Mappings 


As I already sketched in the introduction the theory of set mappings 
originated from a question of P. Turan concerning a problem in approximation 
theory. As usual, a set mapping is any function f on a set X such that f(z) 
is a subset of X excluding x. A subset Y C X is free ifa ¢ f(y) forz,yEY. 
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Turan inquired about free sets if X = R and f(a) is always finite. If one 
allows f(x) to be countable then there may be no 2-element free subset if 
|X| < Ni, a moment’s reflection shows that this is just a reformulation of 
Sierpiniski’s statement about the paradoxical decomposition of R?. P. Erdés 
proved [10] that if f(a) is nowhere dense for « € R then there is always 
an infinite free set. Bagemihl extended this to finding a dense free set [2]. 
Hechler proved [22] that under CH there is a set mapping f for which an 
uncountable free set does not exist and f(a) is an w-sequence converging to 
x! Uri Abraham observed that this may not occur if MA,,, holds and the 
existence of an uncountable free set if f(a) is assumed to be nowhere dense 
is consistent with and independent of MA,,, (see [1]). 

C. Freiling investigated what happens if a set mapping is defined on a 
set of size Ny. One of his results can be formulated as follows. If A C R 
has cardinal < Nj and f is a set mapping on A with no free subsets of 
size 2 such that f(a) is always of first category then either A is the union 
of & first category sets or every subset of A of cardinal < |A| is of first 
category. It is easy to see that in either case there is a well order < of A 
in which every proper initial segment is first category. In recent work the 
author (of this paper) proved that it is consistent that there is a set AC R 
with |A| = 83 with a set mapping as above but with no well order with first 
category segments [28]. 

In [16] Erdés and Hajnal proved that if f is a set mapping on R such 
that f(z) is always of measure zero and not everywhere dense then there is 
a free pair but not necessarily a free triplet. They also proved that if f(a) is 
bounded and of outer measure at most 1 then for every k < w there is a free 
set of size k and asked if an infinite free set exists. This was finally proved 
in [80]. 
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Summary. We investigate the property of certain well-founded orderings to have 
a chain of maximal ordinal length. We show that Heyting algebras and countable 
modular lattices have this property, but we also present an example of a “well- 
behaved” lattice which does not have it. We prove a general necessary and sufficient 
condition for modular lattices to have the property in a hereditary form. This 
hereditary form is called order-perfectness, being analogous to perfectness of finite 
graphs. Certain well-known theorems of D.H.J. de Jongh, R. Parikh, D. Schmidt, 
E.C. Milner, N. Sauer, and N. Zaguia turn out to be order-perfectness results. 


1. Introduction 


In every ordering of a set, there are maximal chains with respect to inclusion. 
In well-founded orderings, chains have ordinal type. However, different 
maximal chains may have different ordinal types and a question arises as to 
whether or not there is a chain of maximal order type. This need not be true 
in general. It turns out to be interesting to investigate well-founded orderings 
A such that A itself and also all the sets of the form {x € A | x < a} for 
an a € A have a chain of maximal order type. We call such orderings order- 
perfect. A motivation for this terminology arises in Sect.2. Of course, each 
finite ordering is order-perfect. In fact, each AWPO is order-perfect. This is a 
special case of Theorem 1.2 from Milner and Sauer [3]. Also, the special case 
of it for countable orderings is equivalent to a Theorem by D. Schmidt [5]. 

The aim of this paper is to prove order-perfectness for different kinds of 
orderings, namely for certain classes of lattices. To indicate the motivation, 
let us take an example. For a WPO A, the set | A of all downward closed 
subsets is well-founded (the converse is also true). Maximal chains in | A are 
in 1-1-correspondence with linear extensions of A. As is well known (see [2]), 
for each WPO A there exists a linear extension of maximal ordinal type, and 
so | A is order-perfect. This is a special case of our result saying that each 
well-founded completely distributive lattice is order-perfect. 


“The author is an Alfred P. Sloan Fellow and an NSF National Young 
Investigator 


R.L. Graham et al. (eds.), The Mathematics of Paul Erdés I, 427 
DOI 10.1007/978-1-4614-7254-4_25, 
© Springer Science+Business Media New York 2013 


428 Igor Kriz 


More generally, we present a necessary and sufficient condition for a well- 
founded modular lattice to be order-perfect. This condition is true for Heyting 
algebras (cf. [1]) as well as for countable modular lattices. We do not know 
if there is a natural common generalization of these two facts. In Sect. 4 
we show that one fairly natural candidate fails (the counter-example is a 
complete sublattice of Vect(R”)). 


Problem 1. Is there a class C of lattices which can be characterized 
by equations, includes both Heyting algebras and countable modular lat- 
tices and has the property that each well-founded lattice from C is order- 
perfect? 


To conclude this section, let us remark that we call our main concept 
order-perfectness to avoid confusion with the work of Milner and Pouzet [4] 
who consider perfectness of infinite comparability graphs in the usual graph- 
theoretical sense. 


2. Preliminaries: Orderings 


Definition 1. In this paper, ® resp. + denotes the natural (resp. ordinal) 
sum of ordinal numbers. For a well-ordered set K (isomorphic to an ordinal) 
a subset S C K is called closed if 


(TCS and T has a supremum in K) > supT € S. 


Now let A be an ordering. The symbol A designates the ordering obtained 
from A by adding a new greatest element. A chain (resp. strict, decreasing, 
strictly decreasing chain) in A is a sequence (da)acx (A € Ord) such that 
a < 8 implies ag < ag (resp. dg < Gp, Ga > 4B, Ga > ag). The length |c| of 
a chain c = (da)aer 1s the ordinal type of the set {ae A| B< a> ag < ag}. 
A chain (da)aer is continuous, if dsup 1 = sup{a; | i € I} for any I such that 
IU {supI} C ); in particular, the right hand supremum (the unique least 
upper bound) is required to exist. An antichain in A is a set of pairwise 
incomparable elements. 


The ordering A is well-founded (abb. WF) if it has no infinite strictly 
decreasing chain. A mapping y : A > B, where A, B are orderings, is called 
monotone resp. strictly monotone if it satisfies a < b + y(a) < y(b) (resp. 
a <b— g(a) < v(b)). For well-founded A we define 


p(A) = min{y | dy: A> ¥ strictly monotone} 


\(A) = sup{y | de: y > A strictly monotone}. 


Observe that \(A) < (A). We write \(A) instead of \(A) if this supremum 
as attained (i.e., if there is a maximum). 

Put) A={BCA]|ae B&b<a— be B}. Then | A is ordered by 
inclusion. For a € A we denote by |) a (resp.| a) the induced ordering on 
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{be A|b <a} (resp.{b € A| b<a}). An ordering A is called order-perfect 
if we have 
A(- a) = w( a) for any a € A. 


(In particular, the left hand sides are required to exist). We call A stratified, 
if for each a € A all maximal chains in |} A have the same ordinal length. For 
a poset A we say that a € A covers b € A if we have a > b and if there is no 
c€ Asuch that b<c<a. 

Let A be a partial ordering. A (finite or infinite) sequence (a;) in A is 
called bad if we have 


i<jra; £a;. 


The ordering A is called a well-partial-ordering (abb. WPO) if it contains no 
infinite bad sequence. We denote by A<% the set of all finite sequences in A 
and put 


Bad A = {(ao,...,@n) € AS” |i< jG Ga; £a;}. 


Thus, the empty sequence belongs to Bad A. A mapping y : Bad A > y + 
1(y € Ord) is called a character of A if we have 


p(Ax) < p(A) whenever Ax € Bad A. 
Put 


c(A) = min{y | da character y : Bad A > 7 + 1} 
d(A) = max{y | ¥ is isomorphic to a linear extension of A}. 


(Let us recall that an extension means a stronger ordering on the same set.) 
An extension of a WPO is always WPO (such a statement is not true for WF 
orderings). Thus, a linear extension of the WPO A is a well-ordering. The 
existence of a number d(A) was proved in [2]. 

Now let A be WF, a € A. Define the height h(a) of an element a € A 
by h(a) = A(- a). (The reader should be warned that the terminology in 
the literature differs and some authors use the word “height” in a different 
meaning.) We call A almost well-partial ordering (AWPO) if for any sequence 
(a;)iew in A there exist i < 7 with 


a, <a; or h(a;) > h(a;). 
For an ordering A and a set M C A put 
Au ={xEA| (Vy € M)y £ zc}. 
For a subset B C Ord put 
M(B) =sup{#+1| a e€ B}. 
Thus, M(B) is the least ordinal greater than all members of B. 


Proposition 1. Let A be WF. Then A is order-perfect if and only if all the 
numbers (A), A(d) a) (a € A) exist. 
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Proof. If the numbers (A), A() a) exist, then we have 
A a) = M({AC 8) | b < a}) 
MA) = M({A(Y a) | a € Af). 


We can easily check, however, that analogous recurrent relations hold 
for wu. 


Remark 1. Of course, not all WF orderings have to be order-perfect. Put, 
e.g., Aa = Bada with 


a<b= (Ac)a = be. 


Then we have 
A(As) =, (Ay) =a. 
Proposition 2. For each WPO A we have 
c(A)+1= pl) A) =AQ A) =d(A) +1. 
In particular, c(A) = d(A). 


Proof. Once we know that d(A) always exists (see [2] or Sect.3 below), the 
last two equalities are trivial. We present a proof of the first one: <: If y :) 
A — p(J A) is strictly monotone then the mapping 7 : Bad A > p({ A) 
given by w(a1...@n) = Y(A{a,...an}) is a character. >: If 


w:BadA>y+1 
is a character, then define y :| A> y+1 by 
p(B) = min{y(T) | T is a bad sequence in A \ B}. 


To see that y is strictly monotone, let B, C €| A, B C C. There is an 
element 


cEC\B. 
let T be a bad sequence in A \ C with 
y(C) = u(T). 
The concatenation Tc is a bad sequence, since C €| A. Thus, 


9(B) < ¥(Te) < WT) = 9(C). 


Theorem 1 (Milner and Sauer [3]). Each AWPO A is order-perfect. 
Lemma 1. Let I, J, K be well-ordered sets, K =IUJ. Then 


(a) cK) < cI) &e(J) 
(b) Ifmin Kk € J, |I| >w and I, J are closed in K then c(K) < cL) @c(J). 
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Proof. (a) Is a special case of a theorem proved in [2]. 
(b) Suppose the contrary. Without loss of generality, K = « is an ordinal. 
Let » be the largest limit ordinal with 


A<K 
By our assumptions, 
INXAFD & INXAFD. (x) 
There exist a < A, 6 < A such that 8 is sum-closed and 
a+P=d. 
Put 
B' ={atilie By}. 

We will show that 

One of the sets 11 A, JX has a maximum. (#) 


Indeed, suppose (**) false. Since both J and J are closed in K, the sets 
Ing, Iap 
are cofinal in 8’. We conclude that 
cI) 6 c(J) >a +28’ > K. 


Thus, («*) is proved. 
Now let, say, € = max(IM A). Define a mapping y: K > «+1 by 


Observe that we have 
R+1=¢(1) NG(J) 
e(e(1)) = e(L) and e(y(J)) = e(J). 
We conclude that 
(1) ® (J) > e(p(D)) & e(o(J)) > ev) UG) > K+L 


432 Igor Kriz 


3. Lattices: Positive Results 


Observation 1. Let A be a WF lattice. Then A is a complete lattice. 


Proof. Since A has finite meets and is well-founded, it has no empty meets. 


Conditions 1. We say that a WF ordering A satisfies CC (continuity 
condition), if there exists a chain (Gx) see fX(A) such that 


sup h(a;) = (A). 


A WF-ordering A is said to satisfy HCC if for each interval (b, a), b,a € A), 
such that X((b,a)) is sum-closed and of cofinality > w, satisfies CC. 

We say that a lattice A satisfies the condition WHA (weak Heyting 
algebra) if for any set I, |I| < |A|, and elements (b;)ier, a of A we have 


b)Aa= bj Aa). 
a aie i a) (+) 
In particular, the joins in question are required to exist. 


Theorem 2. A modular WF lattice is order-perfect if and only if it satisfies 
ACC. 


Proof will be given below. Until the end of the proof, A is always a well 
founded modular lattice. 


Lemma 2. Let (c;)i<, be a chain in A,a€ A, M = {i < \/c; £ a}. Then 
we have 


I(ciical S (ci A @)ical @ (i V @)iem|- (a) 
Moreover, we have 
I(ci)ical < (ci A @)ical ® (Gi V @)iem| (+) 
whenever 
|| > w. CPi 


Remark 2. The condition (+++) for (++) cannot be weakened. Indeed, 
consider A=w+2,rX}=wt+2,qG =i (i € A), a=w, I = {w+ I}, 
l(ci)icnl = w+ 2, |(c; Aa)iex| = Ww +1, |(c; Va)icx = 1 and equality occurs 
in (+). 


Proof. Let b = (b;)i<x be a chain in A. Put 
I(b) = {4 < K| (Vj < i)b; < by}. 
Obviously, 
|b] = c(1(0)). 
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By modularity of A, 
T((ci)ica) = T(x A @)icn) UI((G V @)iem)- 
Moreover, obviously, the right-hand sets are closed in the left-hand one and 


0 € I((e; Aa)icy). 


The statement of our lemma now directly follows from Lemma 1. 


Lemma 3. If we have b < a and there is a chain (c;) of length a in J) a then 
there are chains of lengths B,y in \} b, (b,a), respectively, such that 


Pay 20h 


Proof. Assume the chain (c;) in |} @ is maximal hence continuous. We 
distinguish two possibilities: 


(a) |(ci Vb | c; £ b)| > w. Then take the chains (c; Ab | c; 7 6), (c¢ Able; £ 6) 
and use Lemma 2(+-+). 

(b) |(c; Vb | c; £ 6)| < w. Then take the chain (c; A b | c; # 6) and the 
concatenation of b and (c; A b | c; £ 6) and use Lemma 2(+) together 
with the commutativity of addition of finite ordinals. 


Remark 3. Trying out a few examples, one can see that the cases (a), (b 
in the proof of Lemma 8 are really governed by different principles. A less 
careful analysis, however, would cause only a discrepancy by 1, which is still 
sufficient for the proof of Theorem 2. 


Lemma 4. Forb<aeéA we have 


A a) < AY 6) @ A((b, a). (+) 
Moreover, if (1) b) exists we have 
(Yb) + A((b, a) < AW a). (++) 


Proof. (++) is trivial, (+) follows from Lemma 3. 


Theorem 3._ If A (a modular WF lattice) is order-perfect then any interval 
(b,a), a,b € A, is order-perfect. 


Proof. First let X((b, a)) be sum-closed. We have a chain (a;) in J} a of length 
A({} a). Thus, there are chains in |} b,(b,a) of lengths 6, y, respectively, 
B®y >A a). By Lemma 4(++), 


ACU. b) + A((b, a)) S AC a). 
Since 8 < A(J) b), we have 
AY b) By SAW a). 
Since \((b,a)) is sum-closed, we have ((b,a)) < 7 and hence 
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X((b, @)) = 7. 

Now let a = A((b,a)) not be sum-closed. We proceed by induction on a. 
First realize that b < c < a > X((b,c)) < A((b,a)). In fact A((b, c)) exists 
by the inductional hypothesis; use Lemma 4(+-+). Now let 8 be the last 
summand of a (in the same sense as in the proof of Lemma 1). Let (a;) be 
a chain in (b, a) of length > a — 6 (the least number k s.t. 6 + 8 > a). Put 
C = Ag—g. We have A((c,a)) = 8: < is obvious and > follows from the fact 
that A((b, c)) < A((b,a)) and 

X((b, c)) ® A((e, a)) = A((b, a)) (+) 
(by Lemma 4(+)). Now A((b, )) exists by the inductional hypothesis and 
A((c, a)) exists by the first part of the proof. By (+), we have 

A((b, €)) + A({e,a)) = AC(b, a)) 
(since A((c,a)) = B). 
Lemma 5. Leta, b€ A. Then we have 


h(a) @ h(b) > h(aV b). 


Proof. Let (c;) be a chain in |} a V b of length a. By Lemma 3, there are 
chains in J) a(a,aV b) of lengths 6,7, respectively, such that 


BOy2a. 
As is well known, (a, aVb) is isomorphic to (a, a\b) via “Ab”. Thus, y < h(b). 


Remark 4. The discrepancy between natural and ordinal sum prevents us 
to obtain an exact result as in the finite case. 


Lemma 6. Let \((a,b)) be sum-closed and of cofinality w,a,b € A. Then 
(a,b) satisfies CC. 


Proof. Choose a; € (a,b) with \((a,b;)) 7 X((a, b)) and put a; = bo V...V bi. 
We have a; < b by Lemma 5 and the sum-closedness. 


Proof of Theorem 2. Necessity follows from Theorem 3. In order to prove 
sufficiency, we introduce the following statement 


S: If A is modular and satisfies HCC and a € A then (I) a) exists. 


This implies our result by Proposition 1, since neither modularity nor 
HCC is violated by adding a new greatest element. 
We shall prove S' by induction on h(a). 


Case 1. h(a) is not sum-closed. Then let a+ 8 = h(a), a, B < h(a), 6 the 
last summand of h(a). Then there exists ab < a with 


h(b) >a. 
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We may assume h(b) < h(a) (otherwise we replace a by b), hence XA(J b) 
exists. By Lemma 4, 


((b, a)) = 6. (+) 


The inductional hypothesis applied to a € (b,a) (a modular lattice satisfying 
HCC) yields 


A((b, a) = B. Ce 
Now by (+),(++), there exist strict chains 
(Ci)iea in Lb 
(di)ieg in (b, a). 
The desired chain is obtained by concatenation. 
Case 2. A= X() a) is sum-closed. Then either by HCC or by Lemma 6 we 
have a strict chain (Gi icern in \L a with 
(dq) ci) 77 A. 


By Lemma 4(+) and the sum-closedness of X we have 


lim = A((cj,¢;)) = 
JiJ>iecfr (( )) 
and thus 


li 49 Cj )) = 
jac ae 


by the inductional hypothesis. Now take an increasing sequence ig(a € cfd) 
such that 


_litm, (Cia tines) => 


Now if (CP? )ier(leing, ci,,,)) #8 @ strict chain, we obtain a chain of length X in 


\} a by concatenation. 


Theorem 4. /f A satisfies WHA and is WF then A satisfies CC (and hence, 
by hereditarity, also HCC). 


Proof. Let A = (0,1), \ = X(A). If there is a sequence (¢;)iecy, such that for 


B<cfxr 

VV Gq, <1, 

i€B 
the statement is proved. So assume the contrary. We will prove that following 
fact: 


T : For any c < 1 we have sup{h(b)|c £ b} <A. 


Really, since A is WF, we can write 


436 Igor Kriz 


c= Va 


icp 
where 6 < cfX and c are irreducible to sums of size < cfA. Now for any 
i € GB the set 


My = {h(b)ex £ b} 


has supremum < A : Otherwise there are an aq # ci, a € cfA, such that 
h(aq) 7 ». Thus, there is a y < cf such that 


Vox 


acer 
Thus, by WHA, 


VV (cj A Ga) = Ci. 


acy 


contradicting the assumed irreducibility of c;. Now 


sup sup M; < A. 
168 


However, T clearly enables us to choose a sequence (c;) with (+), contradict- 
ing the original assumption. 


Remark 5. We have shown that each WF WHA is order-perfect. On the 
other hand, each countable modular WF lattice is order-perfect, since it 
satisfies HCC by default. 


On might look for weakenings of these conditions. It is fairly easy to 
find an example of a distributive complete well-founded lattice which is not 
order-perfect: Take, for instance, the set 


A= {y: ur \ {0} > willar \ 97" (0)| < w& (Va € w1 \ {0}) 9(@) < a}. 
and put 
gy Sv iff (Va € wy \ {0})y(a) < Ya). 


This makes A a well-founded distributive lattice. A is a well-founded complete 
distributive lattice. Obviously, 


\(A) =W1 


while there is no uncountable chain. An observation of this kind has been 
first made in Zaguia [6]. 

This example teaches us that the infinite joins (at least of cardinality 
< |A|) indeed have to be “tightened up” in some way. On the other hand, 
the (finite) distributivity is obviously not needed. 

Note that WHA is equivalent to distributivity plus Lemma 2(+) for chains 
(b;). One might suggest modularity and Lemma 2(+-) for chains (b;). In the 
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next section we shall show, however, that this condition is not sufficient for 
order-perfectness. In fact, we shall find a complete well-founded sublattice 
of Vect(R“)) which is not perfect. This example, because of the strong 
condition on infinite joins is much more interesting than the one mentioned 
above. 


4. Lattices: A Negative Result 


Theorem 5. There exists a complete sublattice P of Vect(R“)) which is 
WF but not order-perfect. 


(In this paper, (R‘4)) means the free R-modul over A. Vect(X) stands 
for the lattice of all submodules of X.) 


Lemma 7. For any a < w, there exists a WPO Ba such that \(Ba) = w 
c(Ba) >a, Ba is stratified and 


{x € By|h(a) =n}, =n+1. (+) 


Proof. Let p : a > w be a bijection. Put By = {(a,k)|y(a) < k}. Now 
let a, k < (8,m) if and only if a < 6 and k < m. Clearly A(Ba) = w and 
Ba is stratified (since h(a,k) = k). For the same reason Ba, satisfies (+) 
(since y is a bijection). Now B, is WPO, since it is the intersection of three 
(linear) well-orderings: The lexicographical one prefering first coordinate, 
the lexicographical one prefering second coordinate and the ordering < 
given by 


(a,k) ~ (8,m) iff k < mor (k =m and 6 <a). 


Construction 1. In the sequel, we will assume that we are given disjoint 
sets Bala € w; satisfying the conditions of Lemma 7. Let 


R™=@YV,, dimV =21(+4+1)! 
1ew 
Let {u(a, Jla € wi,¢ € Ba, h(c) = i} be vectors of V; in general position. 
Each subset of cardinality < dim Vj; is linearly independent.) For a vector 
y y 


vu € V; define a subspace Ov C V;_1 as follows: 
If there exists a (in the sequel minimal with res pect to inclusion) set 


{vu(ai,¢1),.--,U(Qn, Cn) } 
such that v € (v(ay,¢c1),---,U(Qn,Cn)) (the linear hull) and 
n<2*(i+1)! 
then put 


Ov = (v(a;,d)|i <n and c; covers d). 
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Otherwise put 
Ov = V;_1.- 


Because of the general position of v(a,c),0 is defined correctly. Put for a 
subspace V CV; 


OV = (Ov\v EV). 
Now let 
PC @ VectV; 


1Ew 


be the set of all subspaces W satisfying 
a(w nV) CW. 
We will show that P is the desired lattice. 


Lemma 8. P is a meet-closed subset of Vect(R™?). 


Proof. It suffices to realize that a space W belongs to P if and only if it 
satisfies a set of conditions of the form 


aeEWwrbew. 


Such conditions are preserved by intersection. 


Lemma 9. Let vu € V;, (v(G1,d1),.-.,U(8m,dm)) > v. Then we have 
Ov C (v(8;, d)|t < m,d; covers d). 


Proof. First realize that if m > 2'(i +1)! then at least 2‘i! of the numbers 
(; are different (by Lemma 7(+) and the Dirichlet principle). Thus, the 
vectors u(8;,d) such that d; covers d are distinct and hence they generate 
Vi-1. If, on the other hand, n < 2(1 +7)! then (by the general position) 
{v(81,d1),.--,U(Bm,dm)} contains the set {v(a1,c1),.-.,U(Qn;Cn)} from 
the definition of Ov. 


Lemma 10. P is a complete sublattice of Vect(R™). 


Proof. Now it suffices to show that P is closed under joins. This, however 


follows from the relation 
a Vi) = VV avi 


which is an easy consequence of Lemma 7. 


Observation 2. Let B €| By. Then 
Ve = ({v(a,c)|c € B}) EP. 


Thus, P contains a chain of arbitrary countable length. On the other hand, 
it does not contain an uncountable one, since Vect(R“?) does not. 
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Lemma 11. Let V € P and let ani<k exist such that V Pp V;. Then there 
exist only finitely many different a with 


v(a,c) EV 
for some c € By, h(c) => k. 


Proof. Because of the stratifiedness of B,, V contains for each such a a vector 
v(a, d) with h(d) = 4, If there were infinitely many such ones then V; C V by 
the general position. 


Lemma 12. P is WF. 


Proof. Let (W;);e. be a strictly decreasing chain in P. Let, without loss of 
generality, Wo D V;,. Now let 


{a1,..-,n} = {a|(du(a,c))h(c) > k&v(a,c) € Wo}. 


Put By = {c| v(ai,c) € Wj}, i <n, j € w. By the definition of P clearly 
Bi €| Ba,. Moreover, we have 


12S (4 2 seo gene 
Since Ba,, is WPO, it holds without loss of generality that 
BSB es 
Since we have j 4 j’ with 
Win oM = WN GierVi, 


we have W; = We . A contradiction. 
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Dedicated to Paul Erdés 


Summary The pcf theorem (of the possible cofinability theory) was proved for 
reduced products [];—,, Ai/I, where & < minicn Ai. Here we prove this theorem under 
weaker assumptions such as wsat(I) < minic, Ai, where wsat(I) is the minimal 0 
such that & cannot be divided to 0 sets € I (or even slightly weaker condition). We 
also look at the existence of exact upper bounds relative to <r (<r-eub) as well as 
cardinalities of reduced products and the cardinals Tp(A). Finally we apply this to 
the problem of the depth of ultraproducts (and reduced products) of Boolean algebras. 


1. Introduction 


An aim of the pcf theory is to answer the question, what are the possible 
cofinalities (pcf) of the partial orders Pis2. di /I, where cf(\;) = Ai, for 
different ideals J on «. For a quick introduction to the pcef theory see [11], and 
for a detailed exposition, see [8] and more history. In §1 and §2 we generalize 
the basic theorem of this theory by weakening the assumption Kk < minjc, rj 
to the assumption that I extends a fixed ideal I* with wsat(I*) < minjc, Xi, 
where wsat(J*) is the minimal 6 such that « cannot be divided to @ sets ¢ I* 
(not just that the Boolean algebra P(«)/I* has no @ pairwise disjoint non 
zero elements). So §1, §2 follow closely [8, Ch.I=[10]], [8, II 3.1], [8, VIII 
$1]. It is interesting to note that some of (as presented in courses and see 
a forthcoming survey of Kojman) those proofs which look to be superseded 
when by [13, §1] we know that for regular @ < A, 6 < X => J stationary 
Se€ IA], S C {6 < AX: cf(d) = O}, give rise to proofs here which seem 
necessary. Note wsat(I*) < | Dom(J*)|* (and reg, (I*) < | Dom(J*)|*) so [8, 
I §1, §2, II §1, VII 2.1, 2.2, 2.6] are really a special case of the proofs here. 
During the 1960s the cardinalities of ultraproducts and reduced products 
were much investigated (see Chang and Keisler [1]). For this the notion 
“regular filter” (and (A, u)-regular filter) were introduced, as: if A; > No, 
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D a regular ultrafilter (or filter) on « then [],_,,Ai/D = (limsupp »;)". 
We reconsider these problems in §3 (again continuing [8]). We also draw a 
conclusion on the depth of the reduced product of Boolean algebras partially 
answering a problem of Monk; and make it clear that the truth of the full 
expected result is translated to a problem on pcf. On those problems on 
Boolean algebras see Monk [6]. In this section we include known proofs for 
completeness. 

Let us review the paper in more details. In 1.1, 1.2 we give basic definition 
of cofinality, true cofinality, pcf() and Je,[A\] where usually \ = (A; : i < k) 
a sequence of regular cardinals, /* a fixed ideal on « such that we consider 
only ideals extending it (and filter disjoint to it). Let wsat(/*) be the 
first 6 such that we cannot partition « to 6 I*-positive set (so they are 
pairwise disjoint, not just disjoint modulo J*). In 1.3, 1.4 we give the basic 
properties. In Lemmal we phrase the basic property enabling us to do 
anything: (1.5(«)) : liminf;«(A) > 6 > wsat(I*), [] \/I* is 0*+-directed; we 
prove that [] A/J<,[A] is A-directed. In 1.6, 1.8 we deduce more properties 
of (Je,[A] : \ € pef(A)) and in 1.7 deal with <j_,[x)-increasing sequence 
(fa: a@ <A) with no <j_,;y)j-bound in J] X. In 1.9 we prove pef(A) has a last 
element and in 1.10, 1.11 deal with the connection between the true cofinality 
of [],<,,4/D* and [],., wi/E when p; =: tef([],.,,41/Di) and D* is the 
E-limit of the D,’s. 

In 2.1 we define normality of \ for A: J<,[\] = Je,{A] + By and we 
define semi-normality: J<[\] = Je,{\] + {Ba : a < A} where Bo/Jey[A] is 
increasing. We then (in 2.2) characterize semi normality (there is a <j_,;y)- 
increasing f = (fa : a < A) cofinal in [] \/D for every ultrafilter D (disjoint 
to I* of course) such that tef([] A/D) = ) and when semi normality implies 
normality (if some such f has a <j_,;3)-eub). 

We then deal with continuity system @ and <jey{j)-mcreasing sequence 
obeying @, in a way adapted to the basic assumption (*) of 1.5. 

Here as elsewhere if min() > 0+ our life is easier than when we just 
assume limsup;.(A) > 0, [[A/I* is 0+-directed (where 6 > wsat(I*) of 
course). In 2.3 we give the definitions, in 2.4 we quote existence theorem, 
show existence of obedient sequences (in 2.5), essential uniqueness (in 2.7) 
and better consequence to 1.7 (in the direction to normality). We define 
(2.9) generating sequence and draw a conclusion (2.10(1)). Now we get some 
desirable properties: in 2.8 we prove semi normality, in 2.10(2) we compute 
cf([] \/I*) as max pef(\). Next we relook at the whole thing: define several 
variants of the pcf-th (Definition 6). Then (in 2.12) we show that e.g. if 
min(\) > 60+, we get the strongest version (including normality using 2.6, 
i.e. obedience). Lastly we try to map the implications between the various 
properties when we do not use the basic assumption 1.5 (*) (in fact there are 
considerable dependence, see 2.13, 2.14). 

In 3.1, 3.38 we present measures of regularity of filters, in 3.2 we present 
measures of hereditary cofinality of [] \/D: allowing to decrease \ and/or 
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increase the filter. In 3.4-3.8 we try to estimate reduced products of 
cardinalities [],_,,;/D and in 3.9 we give a reasonable upper bound by 
hereditary cofinality (< (°/D + hefp6([];—, Ar))<* when 4 > regg(D)). 

In 3.10-3.11 we return to existence of eub’s and obedience and in 3.12 
draw conclusion on “downward closure”. On Tp(f), starting with Galvin and 
Hajnal [2] see [8]. 

In 3.13-3.14 we estimate Tp(A) and in 3.15 try to translate it more fully 
to pef problem (countable cofinality is somewhat problematic (so we restrict 
ourselves to Tp(A) > w = p®°). We also mention &y-complete filters; (3.16, 
3.17) and see what can be done without relaying on pcf (3.20)). 

Now we deal with depth: define it (3.18, see 3.19), give lower bound (3.22), 
compute it for ultraproducts of interval Boolean algebras of ordinals (3.24). 
Lastly we connect the problem “does A; < Depth*(B;) for i < & implies 
us < Depth* ([];-,, Bi/D)” at least when ps > 2" and (Wa < p1)[|a|X° < yi], to 
a pef problem (in 3.26). This is continued in [16]. 

In the last section we phrase a reason why 1.5(*) works (see 4.1), analyze 
the case we weaken 1.5(*) to lim inf;«(A) > @ > wsat(I*) proving the pseudo 
pef-th (4.3). 


2. Basic pcf 


2.1. Notation 


I, J denote ideals on a set Dom(I), Dom(./) resp., called its domain (possibly 
Uae, A C Dom /). If not said otherwise the domain is an infinite cardinal 
denoted by « and also the ideal is proper i.e. Dom(Z) ¢ I. Similarly D denotes 
a filter on a set Dom D; we do not always distinguish strictly between an ideal 
on « and the dual filter on k. Let \ denote a sequence of the form (A; : i < k). 
We say ) is regular if every A; is regular, min = min{); : i < K} (of course 
also in \ we can replace « by another set), and let |] A= I], As; usually we 
are assuming A is regular. Let [* denote a fixed ideal on «. Let IT = P(K)\I 
(similarly Dt = {A Cr: «\A € D}), let 


liminf A= min{p: {i< «2s <p} eT*} and 
limsupA = min{yw:{i<«:A; > pu} eT} and 
I 


atomyA = {yw: {4: A; = p} € It}. 


For a set A of ordinals with no last element, J}? = {B C A: sup(B) < 
sup(A)}, i.e. the ideal of bounded subsets. Generally, if inv(X) = sup{|y| :E 
y[X, y]} then invt (X) = sup{l|y|+ :& LX, y]}, and any y such that / y[X, y] 
is a witness for |y| < inv(X) (and |y| < inv*(X)), and it exemplifies this. Let 
As[\| = (AX: a < 0) = (Ado [A] : a < 0) be defined by: AX = {i << Kw: > 
a}. Let Ord be the class of ordinals. 
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Definition 1. (1) For a partial order* P: 


(a) P is \-directed if: for every A C P,|A| < X there is gq € P such that 
Leavy <q, and we say: q is an upper bound of A; 

(b) P has true cofinality \ if there is (pa : a@ < A) cofinal in P, i.e.: 
NacpPa < pp and Vg € PQ) < Pal [and one writes tcf(P) = A 
for the minimal such X/] (note: if P is linearly ordered it always has a 
true cofinality but e.g. (w,<) x (wi, <) does not). 

(c) P is called endless if Vp € Paq € Plq > p| (so if P is endless, in clauses 
(a), (b), (d) above we can replace < by <). 

(d) AC P is a cover if: Vp € Pag € Alp < q]; we also say “A is cofinal in 
Pe, 

(e) cf(P) = min{|A]: A C P is a cover}. 

(f) We say that, in P,p is a lub (least upper bound) of A C P if: 


(a) p is an upper bound of A (see (a)) 
(8) If p' is an upper bound of A then p < p’. 


(2) If D is a filter on S,as (for s € S) are ordinals, f,g € [[,eg @s, then: 
f/D < g/D, f <p g and f < g mod D all mean {s € S: f(s) < 
g(s)} € D. Also if f,g are partial functions from S to ordinals, D a 
filter on S then f < g mod D means {i € Dom(D) : 1 ¢ Dom(f) 
or f(t) < g(t) (so both are defined)} belongs to D. We write X = A 
mod D if Dom(D) \ [(X \ A) U(A\ X)] belongs to D. Similarly for <, 
and we do not distinguish between a filter and the dual ideal in such 
notions. So if J is an ideal on & and f,g € J], then f <g mod J iff 
{i <K:Af(i) < g(t)} € J. Similarly if we replace the as’s by partial 
orders. 

(3) For f,g : S —  Ordinals, f < g means /\,c5 f(s) < g(s); similarly 
f < g. So (JA, <) is a@ partial order, we denote it usually by [] A; 
similarly || f or [],<, f(i)- 

(4) If I is an ideal on k,F C “Ord, we call g € “Ord an <y-eub (exact 
upper bound) of F if: 


(a) g is an <;-upper bound of F (in “Ord) 

(8) Ifh € “Ord,h <; Max{g,1} then for some f € F,h < max{f,1} 
mod J. 

(y) If ACK, AAO modl and[feFsflA=r04,ie {iE A: 
f(t) FO} € I] then g | A=z 0a. 

(5)(a) We say the ideal I (on &) is 0-weakly saturated if « cannot be divided 

to 6 pairwise disjoint sets from I+ (which is P(K) \ I) 

(b) wsat(I) = min{@: I is 0 — weakly saturated} 


“actually we do not require p < q < p > p=q so we should say quasi partial 
order 
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Remark 1.1A. 


(1) Concerning 1.1(4), note: g’ = Max{g,1} means g’(t) = Max{g(i), 1} for 
each i < «; if there is f € F, {i < «: f(a) = 0} © I we can replace 
Max{g, 1}, Max{f,1} by g, f respectively in clause (G) and omit clause 
(7): 

(2) Considering [],-,, f(i), <r formally if (Ji) f(i) = 0 then [J,_, f(@) = 9; 
but we usually ignore this, particularly when {7 : f(z) = 0} € I. 


Definition 2. Below if T is “a filter disjoint to I”, we write I instead T. 
(1) For a property T of ultrafilters: 


pefp(A) = pef(A,T) = {tef(] | A/D): D is an ultrafilter on « satisfying 
r} 


(so X is a sequence of ordinals, usually of regular cardinals, note: as D 
is an ultrafilter, [] A/D is linearly ordered hence has true cofinality). 
(1A) More generally, for a property T of ideals on & we let pefp(A) = 

{tcf(T] A/J) : J is an ideal on « satisfying T such that T]\/Jhas true 
cofinality}. Similarly below. 

(2) Jex[A,T] = {B C k: for no ultrafilter D on «. satisfying T to which B 
belongs, is tef([] \/D) > A}. 

(8) Jarl, DP} = Jea+[A, ry. 

(4) pefp(A, 1) = {tcef([] A/D) :D a filter on & disjoint to I satisfying T}. 

(5) If B € I*,pef,(A | B) = pefrycnpy(A) (so if B € I it is 0), also 
Jen(A | B, I) C P(B) is defined similarly. 

(6) If 1 =I* we may omit it, similarly in (2), (4). 


If{=Typ ={D: D a filter on « disjoint to I*} we may omit tt. 
Remark. We mostly use pef(X), Jey [AI]. 
Claim 1.3. 


(0) ([]\,<,z) and ([] \,<,) are endless (even when each ; is just a limit 
ordinal); 
(1) min(pef;(A)) > liminf;(A) for A regular; 
(2) (i) If By C By are from I* then pef;(A | Bi) C pef,(A | Ba); 
(ii) If 1 C J then pef (A) C pef;(A); and 
(iii) For B,, By C & we have pef ;(A | (B1 U B2)) = pef (A | Bi) Upef( | 
Bz). Also_ 
(i v) Ae Jex[rA l (Bi UB2)] © ANB, € TzZi|d | Bi] & ANB € TX | Bo] 
(v ) If Ay, Ag e€ It »ANAg = = 0, A,UA,2 = K, and tcf([] t Ap, <j) = d for 
€=1,2 then tof], <;) = »; and if the sequence f = (fa : a < A) 
witness both assumptions then it witness the conclusion. 
(3) (i) If By C By Ck, B, finite and \ regular then 


pef(\ | Bz) \ Rang(A | Bi) C pef;(A | (Bo \ Bi)) € pef (A | Ba) 
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(ii) If in addition i € By = A; < min(Rang|A | (Bo \ B:1))), 
then pef;(X | By)\Rang(X t Bi) = pef; (A | (Ba\B)). 
(4) Let A be regular (i.e. each A; is regular); 


(i) If 6 = liminf; \ then [] A/J is 6-directed 

(ii) If @ = liminf, J is singular then [] \/J is @+-directed 

(iii) If 6 = liminf, X is inaccessible (i.e. a limit regular cardinal), the 
set {i < &: \; = 6} is in the ideal J and for some club E of 8, 
{i<«:2; € E} € I then J] X/I is 6+-directed. We can weaken the 
assumption to “I is not weakly normal for \” (defined in the next 
sentence). Let “J is not medium normal for (0, A)” mean: for some 
he JJ), for no j < 0is fi kh: AX, < O05 h(i) <j} = mod J; 
and let “J is not weakly normal for (9, )” mean: for some h € J] A, 
for no ¢ < liminf;(A) = 0, is fi< kK: < OSA) < Gert. 

(iv) If {4 : A; = 06} = K mod I and J is medium normal for then 
([[ A, <r) has true cofinality 0. 

(v) If [[A/Z is 6-directed then cf([] A/Z) > 6 and min pef;(]] A) > 4. 

(vi) pef;(A) is non empty set of regular cardinals. [see part (7)] 


(5) Assume 2 is regular and: if 6’ =: limsup;(A) is regular then I is not 
medium normal for (6’, A). Then pef;(A) ¢ (limsup;(A))*; in fact for 


some ideal J extending J, [] \/J is (lim sup,(A))+-directed. 
(6) If D is a filter on a set S and for s € S, a, is a limit ordinal then: 


(i) ef([],eg <p) = cef([[,e5 cf(as), <p) = cf([],eg(as,<)/D), and 
(ii) tef([],¢g as, <p) = tef([],<g(cf(as), <p)) = tef([],<g(as, <)/D). 


In particular, if one of them is well defined, then so are the others. This 
is true even if we replace a, by linear orders or even partial orders with 
true cofinality. 

(7) If D is an ultrafilter on a set S, A; a regular cardinal, then 0 =: 
tef([],¢g As; <p) is well defined and @ € pcf({As : s € S}). 

(8) If D is a filter on a set S, for s € S, \, is a regular cardinal, S* = {\, : 
s € S} and 


E={B: BCS* and{s:’A,€B}e D} 
and A, >|S| or at least \, > |{t: Az = As}| for any s € S then: 


(i) E is a filter on S*, and if D is an ultrafilter on S then FE is an 
ultrafilter on S*. 
(ii) S* is a set of regular cardinals and 
ifse S=> 2, >|S| then (VA € S*)A > |S", 
Gii) F={f ¢ [].eg As: 8 =t > f(s) = f(t)} is a cover of [],¢.6 As, 
(iv) cf([],eg As/D) = cf([] $*/E) and tef([[,¢5 As/D) = tef([] S*/£). 


(9) Assume J is an ideal on «, F C “Ord and g € “Ord. If g is a <;-eub of 
F then g is a <;-lub of F. 
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(10) sup pef (3) < [T/T 

(11) IfJ is an ideal on S and ([],¢g as, <z) has true cofinality \ as exemplified 
by f = (fa : a < ) then the function (a, : s € S) is a <; -eub (hence 
<,-lub) of f. 

(12) The inverse of (11) holds: if J is an ideal on S and f, € SOrd for a < A= 
cf(A), (fa 1 @ < A) is <;-increasing with <; — eub f then tef([], f(i), <r 
) = tef([] cf[f@], <1) = A. 

(13) If I C J are ideals on « then 
(a) wsat(I) > wsat(J) 
(b) lim inf;(A) < lim inf 7() 
(c) If A = tef([],-,, A) <r) then A = tef([] 


(14) If fi, fe are <;-lub of F then f; =; fo. 


1<K Xi; <u) 


Proof. They are all very easy, e.g. 


(0) We shall show ([] A, <,) is endless (assuming, of course, that J is a proper 
ideal on «). Let f € J] A, then g =: f +1 (defined (f +1)(y) = f(y) +1) 
is in [] A too as each Aq being an infinite cardinal is a limit ordinal and 
f<g mod J. 


(5) Let 6’ =: lim sup;(A) and define 
J=:{ACk: for some 0 < 6’, {i<«:2; > 0 andi € A} belongs to J}. 


Clearly J is an ideal on « extending J (and « ¢ J) and limsup;(A) = 


lim inf 7(A) = 6’. 
Case 1: 6’ is singular 
By part (4) clause (ii), []\/J is (6’)*+-directed and we get the desired 
conclusion. 
Case 2: 6’ is regular. 
Let h € [JA witness that “J is not medium normal for (6’,\)” and let 


J* ={ACk: for some j < 6’ we have {i€ A: h(i) <j}=A_ mod J}. 


Note that if A € J then for some @ < 6’, A’ =: {1€ A: 6, > 0s ET 
hence the choice j =: 6 witness A € J*. So J C J*. Also J* C P(x) by its 
definition. J* is closed under subsets (trivial) and under union [why? assume 
Ao, Ai € J*, A= AgUA1; choose jo, j1 < 6’ such that A, =: {i © Ap: h(t) < 
je} = Age mod I, so j =: max{jo, ji} <6 and A’ = {ie A: h(i) <j} =A 
mod J; so A € J*]. Also x ¢ J* [why? as h witness that J is not medium 
normal for (6’, \)]. So together J* is an ideal on « extending J. Now J* is not 
weakly normal for (8, A), as witnessed by h. Lastly [] \/J* is (6’)+-directed 
(by part (4) clause (iii)), and so pef (A) is disjoint to (6’)*. 

(9) Let us prove g is a <;-lub of F in ("Ord, <;). As we can deal separately 
with I+ A, I+ (K\A) where A =: {7 : g(i) = 0}, and the later case is trivial 
we can assume A = §. So assume g is not a <;-lub, so there is an upper 
bound g/ of F, but not g <; g’. Define g” € “Ord: 
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np. — JO if g(t) < g' (i) 

PO= {ging cot 
Clearly g” <; g. So, as g in an <; - eub of F for J, there is f € F' such that 
g” <r max{f,1}, but B =: {i : g'(t) < g(t)}} #0 mod I (as not g <7 g’) 
sog’ | B=g" | B <; max{f,1} | B. But we know that f <; g’ (as g’ is 
an upper bound of F'’) hence f | B <; g’ | B, so by the previous sentence 
necessarily f [| B =; 0p hence g’ | B =; 0p; as g’ is a <y-upper bound of F 
we know [f’ ¢ F => f’ | B =r; Og], hence by (7) of Definition 1(4) we have 
g | B =; 0g, a contradiction to B ¢ I (see above). 13 


Remark 1.8A. In 1.3 we can also have the straight monotonicity properties of 
pef,(][A,1) in P and in J. 
CLAIM 1.4: 


1) J<a{A] is an ideal (of P(x) ie. on &, but the ideal may not be proper). 

2) IfA <p, then Jea[\] C Jey] 

3) If d is singular, J<,[A] = Jey+[A] = Jal] 

4) If \ ¢ pcf(A), then we have Je,[A] = Jc aA. 

5) If AC K, A ¢ J aglX |; and fa € € JA , A 5 hes :a< X) is <Tey[X]7 
increasing cofinal in ([] \ [| A)/Je,{A] then Z € J<y[A]. Also this holds 
when we replace Jz [{\] by any ideal J on x, I* C J C Jey. 

6) The earlier parts hold for Jz,[A,I] too. 


Proof. Straight. 
Lemma 1. Assume 
(*) is regular and 


(a) mind > 6 > wsat(I*) (see 1.1(5)(b)) or at least 
(8) liminf;«(A) > 6 > wsat(I*), and [] \/I* is 6+ -directed.** 


If X is a cardinal > 0, and « ¢ Jey{A] then (T] ds <3 [3)) is -directed 
(remember: Jey{A] = Jey{A, I*]). 


Proof. Note: if f € [JA then f < f+1€]]\, (ie. (TI), <,j,[x}) is endless) 
where f +1 is defined by (f +1)(i) = f(i)+1. Let F CJJA,|F| < A, and we 
shall prove that for some g € [] \ we have (Vf € F)(f <g mod Je,{))), this 
suffices. The proof is by induction on |F'. If |F'| is finite, this is trivial. Also 
if |F| < 0, when (a) of (*) holds it is easy: let g € J] be g(i) = sup{f (i) : 
f € F} < Xj; when (8) of («) holds use second clause of (3). So assume 


**note if cf(0) < @ then “O*-directed” follows from “0-directed” which follows 
from “lim inf;+(X) > 0, i.e. first part of clause (3) implies clause (8). Note also ee 
if clause (a) holds then I] \/I* is 0*-directed (even ([] \,<) is 0*-directed), s 
clause (a) implies clause (). 
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|F| = ,0<pw<Asolet F={f2:a < py}. By the induction hypothesis we 
can choose by induction on a < 1, f1 € [] A such that: 


(a) fo < fa mod J<y[A] 
(b) For 6 < a we have f3 < f{ mod Jes(X: 


If uz is singular, there is C C yu unbounded, |C| = cf(u) < pm, and by 
the induction hypothesis there is g € [JA such that for a € C, fi < g 
mod Je,{A]. Now g is as required: f° < fi < fnin(C\a) <g mod Jey{A]. So 
without loss of generality yw is regular. Let us define A? =: {i < K: 2; > |e|} 
fore <0,soe<¢<0= A? C Ai ande <0 => A? =« mod I*. Now we 
try to define by induction on € < 0,g-,a- = a(e) < wand (BS : a < yu) such 
that: 

(i) ge € [JA 

(ii) For « < ¢ we have g- | At < ge T AZ 

(iii) For a < p let BE =: {i< «: f)(4) > ge(i)} 

(iv) For each € < 0, for every a € [ae41, 1), BE # B&t! mod Jey[)]. 

We cannot carry this definition: as letting a(*) = sup{a, : € < 9}, then 
a(*) <p, since p = cf(1) > 6. We know that Bi.) Az, # Br nN At, 


for a < @ (by (iv) and as At,, =« mod J* and I* € Jga[A}) and Be, G & 
(by (iii)) and fe < ¢ > BM AE C BE] (by (ii)), together (AZ, 9 
(Bea) \ Beto) :€ <6) is a sequence of 6 pairwise disjoint members of (I*)T, 
a contradiction*** to the definition of 6 = wsat(J*). 

Now for ¢ = 0 let g; be f¢ and a, = 0. 

For ¢ limit let g-(?) = Uce. g(t) for i € Az and zero otherwise (note: 
ge € []A ase < 6,2; > for i € A® and 4 is a sequence of regular cardinals) 
and let a, = 0. For e = ¢+1, suppose that g¢ hence (BS : a < p) are defined. 
If BS € Jey{A] for unboundedly many a < yu (hence for every a < 1) then ge 
is an upper bound for F mod Jz,{A] and the proof is complete. So assume 
this fails, then there is a minimal a(¢) < ps such that Beis ¢ Jey[A). As 
Boe ¢ Jex[A], by Definition 2(2) for some ultrafilter D on k disjoint to 
Jex[A] we have Bee € D and cf([] A/D) > 2. Hence {f!/D : a < p} has 
an upper bound h;/D where h, € [J X. Let us define g- € [] A: 


ge(t) = Max{ge(2), he (2) }. 


Now (i), (ii) hold trivially and Bé is defined by (iii). Why does (iv) hold 
(for ¢) with ag41 = a =: a(e)? Suppose a(e) < a < pw. As for < fa 


*“*in fact note that for no Bz C K(e < 6) do we have: Bz 4 Be+1 mod I* and 
E<C<03 B.N Ae © Be where Ag = & mod I” (e.g. Ac = AZ) 
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mod Je{A] clearly Boe C BS mod Jey{A]. Moreover Je,[A] is disjoint to 


D (by its choice) so Bee € D implies BS € D. 
On the other hand BS is {i < «: fi(i) > ge(i)} which is equal to 
{ie X: fi (i) > ge(i), he(4)} which does not belong to D (he was chosen such 
that f? <h- mod D). We can conclude Bé, ¢ D, whereas BS € D; so they 
are distinct mod Jz,{] as required in clause (iv). 
Now we have said that we cannot carry the definition for all ¢ < 0, so we 
are stuck at some €; by the above ¢ is successor, say € = ¢ + 1, and ge is as 


required: an upper bound for F’ modulo Jzy{AJ. 15 


Lemma 2. : /f (*) of 1.5, D is an ultrafilter on K disjoint to I* and X = 
tef([[A,<p), then for some B € D, ([JA 1 B,<j_.)x)) has true cofinality 2. 


(So Be Jex[A\Jeall by 1.4(5).) 


Proof. By the definition of Je,{A] clearly we have DN Je,{A] = 0. 

Let (fa/D :a < 2) be increasing unbounded in [J \/D (so fa € [])). 
By 1.5 without loss of generality (V8 < a)(fs < fa mod Jey[A)). 

Now 1.6 follows from 1.7 below: its hypothesis clearly holds. If \ 2, Ba = 
mod D, (see (A) of 1.7) then (see (D) of 1.7) JA D = @ hence (see (D) of 
1.7) g/D contradicts the choice of (fo/D:a< A). So for some a < X, Ba € 
D; by (C) of 1.7 and 1.4(5) we get the desired conclusion. 1.6 


Lemma 3. : Suppose (*) of 1.5, cf(A) > 9, fa €T]A, fa < fg mod J<y[A] 
fora <8 <X, and there is no g € [JX such that for every a <r, fa < g 


mod Je,[A]. Then there are By (for a < X) such that: 


(A) Ba C « and for some a(*) < A: a(*) <a <A> Ba ¢ Jey[D] 

(B) a<B=> Ba © Bg mod Je,{A (i.e. Ba\Ba € Jex[N) 

(C) For each a, (fg | Ba : B < X) ts cofinal in (TTX T Bas <32, 15) 
(better restrict yourselves to a > a(x) (see (A)) so that necessarily 
Ba ¢ JexlAl);- _ = 

(D) For some g € TIA, Agcy fa < g mod J where J = Jey[A] + {Ba : 
a < A}; in fact 

(D)+ For some g € [JA for every a < , we have t fa <g mod (Jey[A] + 

Ba), in fact Ba = {i < &: falt) > g(a)} 

(E) Ifg<g' € J), then for arbitrarily large a < : 


{i<K: [9(t) = falé) + 9'(t) 2 folt)|} =*% mod J<y[dJ 


(hence for every large enough a < X this holds) 
(F) If 5 is a limit ordinal < A, fs is @ <j, ;y)-lub of {fa sa < 5} then 
Bs is a lub of {By : a < 5} in P(k)/JealAl- 


+Of course, Ba = & mod Je,(A), this becomes trivial. 
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Proof of 1.7. Remember that for e < 0, At = {i < Kw: A; > lel} so AE =k 
mod J* and « < ¢ = Ag © At. We now define by induction on « < 8, 
Ge, @(E) < A, (BE : a < A) such that: 


) 
(i) For C<e, ge TADS ge 1 AY 
(iii) BE =: {1 EK: fol(t) > ge(i)} - 
(iv) If a(e) <a <A then BE ¢A Bet! mod Jey{] 
For ¢ = 0 let ge = fo, and a(e) = 0. 
For ¢ limit let g-(?) = Uce, ge(t) if i € Az and zero otherwise; now 
IC <e= 9¢ 1 Ae < ge TAZ] 


holds trivially and g- € [J] as each ); is regular and [i € At © Xj > €¢]), 
and let a(e) = 0. 

For ¢ =¢+1, if {a < A: BS € Jey[A}} is unbounded in 4, then ge is a 
bound for (fa :a@< 2) mod Je,{A], contradicting an assumption. Clearly 


a<B<rA\=>BOCBS mod Jen 


hence {a < A: BS € Jey[A}} is an initial segment of . So by the previous 
sentence there is a(¢) < A such that for every a € [a(e), A), we have BS ¢ 
Je [A] (of course, we may increase a(<) later). If (BS : a < A) satisfies the 
desired conclusion, with a(¢) for a(*) in (A) and ge for g in (D), (D)* and 
(E), we are done. Now among the conditions in the conclusion of 1.7, (A) 
holds by the definition of BS and of a(e), (B) holds by BS’s definition as 
a<B=> fa < fg mod Jey{X], (D)* holds with g = ge by the choice of BS 
hence also clause (D) follows. Lastly if (E) fails, say for g’, then it can serve 
as ge. Now condition (F) follows immediately from (iii) (if (F) fails for 6, then 
there is BC BS such that \,-; BS C B mod Je,[X] and BS\B ¢ Jz,(X)); 
now the function g* =: (g¢ [ («\B)) U (fs | B) contradicts “fs is a <j_. yy 


lub of {fa : a < d}”, because: g* € I] (obvious), =(fs < g* mod Je,[A]) 


[why? as BS\B ¢ J<,[A] and g* | (B§\B) = g¢ | (B§\B) < fs | (B3\B) by 
the choice of BS], and for a < 6 we have: 


fol B<y.,;y fo 1 B =g |B and 
fa T (K\B) <3 2.13) 9¢ | (K\B) = g* T (K\B) 


(the <j_,)3) holds as («\B) BS € Jea{A] and the definition of BS). So 
only clause (C) (of 1.7) may fail, without loss of generality for a = a(e). Le. 
(fa t Bois) : 8 < A) is not cofinal in ([] A | Boas <Jea th): As this sequence 
of functions is increasing w.r.t. <j_.)3), there is ha € [](A | Bee) such that 


for no B < Ado we have hy < fg | Boe) 


mod Jey]. Let hy = heW0(.\ BE) 
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and ge € [JX be defined by g-(i) = Max{ge(2), hi (i)}. Now define B® by (iii) 
so (i), (ii), (iii) hold trivially, and we can check (i v). 

So we can define g-,a(e) for e < 6, satisfying (i)—(iv). As in the proof of 
1.5, this is impossible: because (omen beene cf(A) = A > 6) letting a(*) = 
Uzco a(e) < A we have: (By, () NM Ag: e <¢) is C-decreasing, for each ¢ < 0, 


and At =« mod I* and Bry. » # Be mod Je,[A] so (B eG ae 41\ Bete | 


é < 6) is a sequence of 0 Sere aot members of (Jz {A])* hence of 
(I*)* which give the contradiction to (*) of 1.5; so the lemma cannot fail. 


1.7 


Lemma 4. : Suppose (*) of 1.5. 
(1) For every B € Jey{A]\Jea[A], we have: 


{JA | B,<j2,13)) has true cofinality \(hence X is regular). 


(2) If D is an ultrafilter on «, disjoint to I*, then cf(|]J A/D) is min{X : 
DnJeaD] #0}. _ : 

(3) (i) For X limit Jey [A] = pen I<nlA] hence 
(ii) For every , Jex[A] = ner JulAl- : 

(4) Jeald) A Seal) iff Jer A)\J<alAl 4 0 iff d € pef(A). 

(5) Jex[Al/ FerAl is A-directed (i.e. if By € Jed] for y < y*,y* <A then 
for some B € J<y|A] we have By CB mod Je,[A] for every y < *.) 


Proof. (1) Let 
J={BCK: Be Je[N or BE Je[A\JealA] and 
[A A | B,<j.,)x)) has true cofinality A}. 


By its definition clearly J C J<x[A]; it is quite easy to check it 
is an ideal (use 1.3(2)(v)). Assume J # J<)[\] and we shall get 
a contradiction. Choose B € J<,[A]\J; as J is an ideal, there is 
an ultrafilter D on « such that: DO J = @ and B € D. Now if 
tef([J A/D) > At, then B ¢ Je [A] (by the definition of J<)[A]); 
contradiction. On the other hand if F C J]A,|F| < then there 
is g € [JA such that (Vf € F)(f < g mod Je,[A}) (by 1.5), so 
(Vf € F)[f <g mod D] (as Je,[A] C J, DNJ = 9), and this implies 
ef([] A/D) > 2. By the last two sentences we know that tef([] \/D) is 
d. Now by 1.6 for some C € D, (T](A | C), <y-y[))) has true cofinality 
A, of course CNB C C and CNB € D hence CNB ¢ Jey{X). Clearly if 
C'CC,C' € Jey{A] then also (TTA | C’,<5_,)3]) has true cofinality 
X, hence by the last sentence without loss of generality C' C B; hence 
by 1.4(5) we know that C € J<,[A] hence by the definition of J we 
have C' € J. But this contradicts the choice of D as disjoint from J. 
We have to conclude that J = J< [A] so we have proved 1.8(1). 
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(2) Let \ be minimal such that DN J<y[A] 4 0 (it exists as by 1.3(10) 


Jeqqx)+ lA] = P(«)) and choose BE DN JeylA]. So [u <A> BE 
J<,[A]] (by the choice of A) hence by 1.8(3)(ii) below, we have B ¢ 
J<x{\). It similarly follows that DO J<,[A] = 0. Now (T]A } By <4: 
) has true cofinality A by 1.8(1). As we know that B € DN Je<)[Al, 
and Je,{A]  D = 0; clearly we have finished the proof. 

(3)(i) Let J =: Uc, J<u[A]- Now J is an ideal by 1.4(1)+(2) and ([] A, <y) 
is A-directed; i.e. ifa* < AX and {fa : a <a*} CTT]A, then there exists 
f € JA such that 


(Va <a*)(fa < f mod J). 


[Why? if a* < @* as (*) of 1.5 holds, this is obvious, suppose not; » 
is a limit cardinal, hence there is u* such that a* < p* < A. Without 
loss of generality ja*|+ < p*. By 1.5, there is f € [[A such that 
(Va <a*)(fo < f mod Je,«[A]). Since Je,~[A] C J, it is immediate 
that 


(Va <a*)(fa < f mod J).] 


Clearly Une, J<ulA] © J<alA] by 1.4(2). On the other hand, let 
us suppose that there is B € (JealAl\ pea Je,[A]). Choose an 
ultrafilter D on « such that B € D and DN J = 9. Since ([]\, <z) 
is \-directed and DN J = 0, one has tef([]\/D) > 2, but B € 
DN Je{A], in contradiction to Definition 2(2). 

(3)(ii) If A limit—by part (i) and 1.4(2); if A successor—by 1.4(2) and 
Definition 2(3). 

(4) Easy. 

(5) Let (fg :a@ <A) be <j_.13)4(«\B,)-increasing and cofinal in |] A (for 
y < y*). Let us choose by induction on a < \ a function fa € []A, 
as a <j_,;x)-bound to {fg :B <a}U{fa:7 <7}, such fa exists 
by 1.5 and apply 1.7 to (fa : a < A), getting (BY: a < A), now Bi, 
for a large enough is as required. 1.8 


2.2. Conclusion 


If (*) of 1.5, then pef(A) has a last element. 


Proof. This is the minimal A such that « € J<)|[A]. A exists, since A* =: 
IITA € fA: & © Jey [A]} A O and by 1.4(2); and  € pef(A) by 1.8(4) and 
A = max pcef (A) by 1.4(7)+ 1.8(4). 1.9 


CLAIM 1.10: Suppose (*) of 1.5 holds. Assume for j < 0, D; is a filter on 
« extending {«\A:Ae€I*}, Fa filterong and D*={BCK:{j<a0:BeE 
D;} € E} (a filter on «). Let yu; =: tef([] A, <p,) be well defined for j < ¢, 
and assume further y; > 0+ (where @ is from (*) of 1.5). 
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Let 
A= tef(] | r, <p*), b= tef(] | bj, <z). 
j<o 
Then A = p (in particular, if one is well defined, than so is the other). 
Proof. Wlogo > @ (otherwise we can add pu; =: uo, Dj; =: Do for j € 6\ a, 
and replace o by 6 and E by E'= {AC @:ANo€ E}). Let (fi : a < py) 
be an <p,-increasing cofinal sequence in ([] A, <p,). 

Now ¢ = 0, 1, for each f ¢ J], define Ge(f) € Te, ui by Ge(f)(J) = 
min{a < yw; :if€ =1then f < f2 mod D; and if ¢ = 0 then: not fi <f 
mod D;} (it is well defined for f €¢ [] \ by the choice of (f% : a < jj)). 

Note that for ft, f? € [] A and @ < 2 we have: 

Par mod DSBs yy St<Ke OP Oyen 
@ Aft, f?) = {ji <0: Bf, f?) ¢ Dj} EE 
& for some A € E, for every i € A we have f! <p, f? 
= for some A € £ for every i € A we have 
Ge(f*)(i) < Ge(f?)@) 
ca Go(f') < Go(f?) mod E. 
So 


®1 Gp is a mapping from ([] X, < p+) into (Tees [t;, <<) preserving order. 
Next we prove that 2 
@2 For every g € |], j for some f € []A, we have g < Go(f) mod E. 


[Why? Note that min{y; : j < o} > ot > OF and J<o[A] C J<o[A]. By 1.5 
we know ([] A, <j). ox) is ot-directed, hence for some function f € J] 1: 


(*)1 For j < o we have Les <f mod J<,[A]. 


We here assumed o < jij, hence J<g[A] C Ie [A] (by 1.4(2)) but Je, by 


is disjoint to D; by the definition of J<,, [A] (by 1.8(2)+1.3(13)(c)) so together 
with (*)1: 
(x)o For j <9, fig < f mod Dj. 
So by the definition of Go for every 7 < o we have g(j) < Go(f)(j) 
hence clearly g < Go(f).] 
®3 For f € [JA we have Go(f) < Gi(f) [Why? read the definitions] 
@®,4 If fi, fe E I], and Gi(fi) <E Go(f2) then fi <p* fe 


[Why? as Gi(fi) <z Go(fe) there is B € E such that: j € B= 
Gi(fi)G) < Go(f2)(J) so for each 7 € B we have fi,<p, fa, (7,5) (by 
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the definition of Gi(f1)) and f2,¢7)(9) <p, fa (as Gilfi)(9) < Go(fe)(9) 
and the definition of Go(f2)(j)) so together fi <p, fz. So A(f1, fo) = {i < 
kK: fi(t) < fo(i)} satisfies: A(fi, fo) € D; for every 7 € B but B was chosen 
in E, hence A(fi, f2) € D* (by the definition of D*) hence fi <p» fo as 
required] 

Now first assume \ = tcf([] \, <p~) is well defined, so there is a sequence 
f = (fa : w < X) of members of [] A, <p+-increasing and cofinal. So (Go(fa) : 
a < A) is <p-increasing in [],., Hj (by @1), for every g € [],-, My for some 
f © TP) we have g <gz Go(f) (why? by 2), but by the choice of f for some 
B <2 we have f <p» fg hence by ®, we have g <p Go(f) <z Go(fa), so 
(Go(fa) : a < A) is cofinal in ([],<, Hj, <x). Also for every a < , applying 
the previous sentence to G(fa) + 1(€ []j,<, mj) we can find B < A such 
that G(fa) +1 <e G(fg), so G(fa) <z G(fa) so for some club C of A, 
(Go(fa) : @ € C) is <p-increasing cofinal in ([[,., Hj,<z). So if A is well 
defined then px = tcf([],-, Hj, <z) is well defined and equal to 2. 

Lastly assume that ju is well defined ie. [],., 1; /E has true cofinality pu, 
let 9 = (ga : @ < ys) exemplifies it. Choose by induction on a < p, a function 
fa and ordinals 8.,7%q such that 


(i) fa € J] X and Ba < wand ya < 
(ii) IB. <E Golfa) < eGi(fa) <z Ire (so Ba < Ya) 
(ili) a1 < a2 <=> Yar < Baz (80 Ba > a) 


In stage a, first choose Bg = Uf{ya; +1: 01 < a}, then choose fy € [] A such 
that gg, +1 <z Go(fa) (possible by 2) then choose 7¥q, such that Gi(fa) <z 
Gra. Now Go(fa) <p Gi(fa) by @3. By @4 we have ay < a2 > fa, <d« fas- 
Also if f € []\ then Gi(f) € [],<, uj hence by the choice of g, for some 
a < p we have Gi(f) <gz ga but a < Ba so Gi(f) <e ga <e& Go(fa) hence 
by @4, f <p~ fa. Altogether, (fa : a < mu) exemplifies that ([] A, <p~) has 
true cofinality jz, so is well defined and equal to wu. 11 


2.3. Conclusion 


If (*) of 1.5 holds, and o, fi = (yw; : j < o),(Dj : j < o) are as in 1.10 
and o + 6 < min(j), and J is an ideal on o and J an ideal on « such that 
I*CIC{ACk&: for some B € J for every j € 0 \ A we have B ¢ D;} (e.g. 
T= 1°) then pef ,({u3 : < o}) € pef (0). 


Proof. Let E’ be an ultrafilter on o disjoint to J then we can define an 
ultrafilter D* on « as in 1.10, so clearly D* is disjoint to J and we apply 
1.10. 111 
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3. Normality of X € pcf(A) for A 


Having found those ideals J-,[A], we would like to know more. As Jz [A] is 


increasing continuous in A, the question is how Jz[A], J < [A] are related. 


The simplest relation is Je, + [A] = J<a[A] + B for some B C k, and 


then we call X normal (for A) and denote B = By,[A] though it is unique 
only modulo Je,[A]. We give a sufficient condition for existence of such B, 
using this in 2.8; giving the necessary definition in 2.3 and needed information 
in 2.4, 2.5, 2.6; lastly 2.7 is the essential uniqueness of cofinal sequences in 
appropriate |] A/T. 


Definition 3. (1) We say € pef(X) is normal (for X) if for some B Cr, 
JeylA] = JenlA] + B. 5 
(2) We say A € pcf(A) is semi-normal (for X) if there are By fora < X such 


that: 


(i) a< B> By CBg mod Jza[A] and 
(ii) J<r[A] = Jer lA] + {Baia < A}. 


(3) We say » is normal if every X € pcf(A) is normal for X. Similarly for 
semi normal. 

(4) In (1), (2), (3) instead X we can say (A, I) or [A/T or ([]A, <r) if we 
replace I* by I (an ideal on Dom(X)). 


3.1. Fact 


Suppose (*) of 1.5 and 2 € pcf(A). Now: 
(1) A is semi-normal for X iff for some F = {fa : a < A} C [JA we have: 


la< B= fa < fe mod Je y{A]] and for every ultrafilter D over & disjoint 
to Je,|A], F is unbounded in ([[ A, <p) whenever tef([] A, <p) = A. 
(2) In 2.1(2), without loss of generality, we may assume that 


either: By = Bo mod Je,{A] (so A is normal) 
or: By # Bg mod J<,[A| for a < 8 < so 2 is not normal. 

(3) Assume ) is semi normal for X. Then \ is normal for iff for some F as 
in part (1) (of 2.2), F hasa <j, y)-exact upper bound g € J]; <,,(Ai + 1) 
and then B =: {i < «: g(t) = A;} generates J<y[A] over Je [AJ]. 

(4) If \ is semi normal for \ then for some f = (fa :a@ <2), B= (Bu: a< 
) we have: B is increasing modulo Je[A], J<, [A] = Jea[A] + {Ba : a < 
A}, and the sequences (fq : a < A) is <j, py increasing and f, B areas 
in 1.7. 


Proof. (1) For the direction =, given (Ba : a < A) as in Definition 3(2), for 
each a < A, by 1.8(1) we have ([]A | Ba, <j_,)3j) has true cofinality 
A, and let it be exemplified by (fg : 6 < A). By 1.5 we can choose by 
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induction on y < Aa function f, € [] A such that: 8,y <a> ff Sigial 
f, and B<y=> fe <7 fee 

Now F =: {fa : @ < A} is as required. [Why? First, obviously a < 
B => fa < fg mod Je [A]. Second, if D is an ultrafilter on « disjoint 
to I* and ([]A, <p) has true cofinality \, then by 1.6 for some B € 
Je, [A] \ Jex[A] we have B € D, so for some a < A,B C By mod Jey[A] 
hence Bo € D. As Sg Sei fg for B € [a, A) clearly F is cofinal in 
(IIA, <p).] 

The other direction, < follows from 1.7 applied to F = {fa :a< A}. 


[Why? we get there (Ba : a < A), Ba € J, [A] increasing modulo Je,[A] 
so J =: Jey[\] + {Bait a <A} C Jey fA). 

If equality does not hold then for some ultrafilter D over k, DN J = 0 
but DM Je,[A] 4 0 so by clause (D) of 1.7, F is bounded in [] A/D 
whereas by 1.8(1),(2), tef([] \, <p) = A contradicting the assumption 
on F’] 

(2) Because we can replace (By : a < X) by (Ba, : i < ) whenever (a; : 1 < 
X) is non decreasing, non eventually constant. 

(3) If A is normal for A, let B C « be such that J<,[A] = Jey[A] + B. By 
1.8(1) we know that ([](A | B), <j_,[x)) has true cofinality A, so let it 
be exemplified by (f2 : a < A). Let fo = fo U0(«\B) for a < A and let 
g € “Ord be defined by g(i) = A; if i € B and g(i) =0ifte«\ B. Now 
(fa 1a <A), g are as required by 1.3(11). 

Now suppose (fq : a < 4) is as in part (1) of 2.2 and g isa <j yy 
eub of F, g € []j2,Q4i + 1) and B = {i : g(i) = Ax}. Let D be an 
ultrafilter on « disjoint to Je,[A]. If B € D then for every f € J] A, let 
f' = (f | B) U0 \ B), now necessarily f’ < max{g,1} (as [i ¢ B= 
fii) < A; = g(@i)| and [fie «\ B= f'(i) =0 < g < IJ), hence (see 


Definition 2(4)) for some a < » we have f’ < max{fa,1} mod Jey[A] 
hence for some a < A, f’ < fa mod Jey{[A] hence f < f’ < fa mod D; 
alsoa <8 => fa < fg mod D, hence together (fa : a < X) exemplifies 
tef(T]\,<p) =A. If B¢Dthenk\BEDsog' = gl (k\B)U0p=g 
mod Danda<A=> fa <p fay <p g =p 9’, 80g' € II) exemplifies 
F is bounded in ([] \, <p) so as F is as in 2.2(1), tef([] A, <p) = 2 is 
impossible. As D is disjoint to Jey[A], necessarily tef([] \, <p) > A. The 
last two arguments together give, by 1.8(2) that J<, [A] = Je,[A]+ B as 
required in the definition of normality. 

(4) Should be clear. 22 


We shall give some sufficient conditions for normality. 


Remark. In the following definitions we slightly deviate from [8, Ch.I= 
[10]]. The ones here are perhaps somewhat artificial but enable us to deal 
also with case (3) of 1.5(«). Ie. in Definition 4 below we concentrate on the 
first 9 elements of an aq and for “obey” we also have A* = (Ay : a < 6) and 
we want to cover also the case @ is singular. 
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Definition 4. Let there be given regular X, 0 < uu < A, possibly an ordinal, 
SCX, sup(S) = » and for simplicity S is a set of limit ordinals or at least 
have no two successive members. 


(1) We call @ = (a, : a < X) a continuity condition for (S, 4,0) (or is an 
(S, 1, 0)-continuity condition) if: S is an unbounded subset of ,da C 
a, otp(aq) < p, and [B € dg > ag = ag Nf] and, for every club E 
of , for somet 6 € S we have 0 = otp{a € as : otp(ag) < @ and for 
no 8B € asNa is (8,a)N E = 0}. We say @ is continuous in S* if 
a é€ S* >a=sup(dq). 

(2) Assume fa € “Ord for a < and A* = (A* : a < 0) be a decreasing 
sequence of subsets of & such that K\ A* € I*. We say f =(fa:a<%X) 
obeys @ = (aq: a < X) for A* if: 


(i) For B € aa, if € =: otp(aa) < @ then we have fg | AZ < fa | AF 
(note: A* determine 0). 


(2A) Let k,r,I* be as usual. We say f obeys a for A* continuously on S* if: 
a is continuous in S* and f obeys @ for A* and in addition S* C S and 
for a€ S* (a limit ordinal) we have fa = fa, from (2B), i.e. for every 
i<« we have fa(t) = sup{fa(t) : 8 € aa} when |aa| < Ai. 

(2B) For given \ = (i: i < &),f = (fo 1 @ < 2») where fa € [A and 
a © X, and 6 let fa € [JA be defined by: fa(i) is 0 if \al > 4 and 
UL fa(t) : a € a} if jal <r. 

(3) Let (S,@) stands for (S,8+1,0); (A, 4,0) stands for “(S, 4,0) for some 
unbounded subset S of X” and (A, 0) stands for (A, 0 + 1,0). 
If each A* is Kk, then we omit “for A*” (but 6 should be fixed or 
said). 
(4) We add to “continuity condition” (in part (1)) the adjective “weak” [‘6- 
weak”) if “B € dg > ag = 4. 8B” is replaced by “a € S&B E dg > 

“hy <a)[aqB C ay8y < min(aa\ (B-+1))&[laa6| <9 = lay] <4)” 

[but we demand that y exists only if otp(da.M 8) < 6]. (Of course a 

continuity condition is a weak continuity condition which is a 6-weak 

continuity condition.) 


“—~ 


Remark 2.8A. There are some obvious monotonicity implications, we state 
below only 2.4(3). 


3.2. Fact 


6 cf(0)=0 
6+ cf(0) <0 
stationary S C {6 < A: cf(d) = 6,}, there is a continuity condition a for 


(1) Let 0, = { and assume A = cf(A) > 6+. Then for some 


¥Note: if otp(as) = @ and 6 = sup(as) (holds if 6 € S, pp = 0+1 and @ continuous 
in S (see below)) and 6 € acc(£) then 6 is as required. 
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(S,6,); moreover, it is continuous in S and 6 € S => otp(as) = 6,; so for 
every club E of X for some 6 € S,Va,Bla< Bkae€as& BEas > 
(a, 8) E #9). 

(2) Assume \ = 0+, then for some stationary 9 C {6 < \: cf(d) = cf(6)} 
there is a continuity condition for (S,9+ 1,0). (In fact continuous in S$ 
and [6 € S => a; closed in 6] and [a € as and 6 € S > dg = a5 Nal.) 

(3) If @ is a (A, 1, 61 )-continuity condition and 6, > 6 then there is a (A, 0 + 
1, 0)-continuity condition. 


Proof. (1) By [13, §1]. 


(2) By [$h351, 4.4(2)] and$ [8, III 2.14(2), clause (c), pp. 135~7]. 
(3) Check. 2.4 


Remark 2.4A. Of course also if 4 = 6* the conclusion of 2.4(2) may well 
hold. We suspect but do not know that the negation is consistent with ZFC. 


3.3. Fact 


Suppose (*) of 1.5, fa <€ [JA for a < \,A = cf(A) > @ (of course K = 
Dom(A)) and A* = A*[)] is as in the proof of 1.5, ie., AX = {i << K: 4 > a}. 
Then 


(1) Assume @ is a 6-weak continuity condition for (S,@),\ = sup(S), then 
we can find f’ = (f! :a< A) such that: 
(i) f2 € TDA, 
(ii) For a < A we have fa < fi 
(iii) For a < B<x we have fa <j ny fp 
(iv) f’ obeys a for A* 


(2) If in addition min(\) > py, S* C S are stationary subsets of \ but @ is a 
continuity condition for (S,,@) and @ is continuous on S* then we can 
find f’ = (fi :a@< A) such that 


(i) f2 <TD) 

(ii) Fora € X\ S* we have f. < fi anda =6B+1EA\S* & BeS*s 
fa < fa 

(iii) For a < 8 < A we have la = gol de 


(iv) f’ obeys @ for A* continuously on S*; moreover 2.3(2)(i) can be 
strengthened to 8 € ag > fe < fa. 


‘the definition of Bf in the proof of [8, III 2.14(2)] should be changed as in 
[Sh351, 4.4(2)] 
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(3) Suppose (f% : @ < A) obeys G continuously on S* and satisfies 2.5(2)(ii) 
(and 2.5(2)’s assumption holds). If ga € [TJ] and (ga : a < A) obeys a@ 
continuously on S* and [a € A\ S* > ga < fo] then A, ga < fi. 

(4) If ¢ < 0, for e < ¢ we have f* = (f& : a < 2), where f© € []A, then in 
2.5(1) (and 2.5(2)) we can find f’ as there for all f° simultaneously. Only 
in clause (ii) we replace fa < fQ by fa | At < f& | AG (and fe < fa by 
fol ALS fit AB). 


Proof. Easy (using 1.5 of course). 

CLAIM 2.5A: In 2.5 we can replace “(*) from 1.5” by “T]A/Je,[A] is 
-directed and liminf;-(X) > 6”. 

CLAIM 2.6: Assume (*) of 1.5 and let A* be as there, 


(1) In 1.7, if (fa : @ < A) obeys some (S,)-continuity condition or just a 
6-weak one for A* (where S C . is unbounded) then we can deduce also: 


(G) the sequence (Ba/J<[A] : a < A) is eventually constant. 


(2) If @* <A then J<a[A]/J<a[A] is \t-directed (hence if \ is semi normal 
for A then it is normal to A). 


Proof. (1) Assume not, so for some club E of \ we have 

(x)a <b <A&dE€ E> By # Bs mod Jex(Al. 

As @ is a 0-weak (, @)-continuity condition, there is 6 € S such that 
b =: {a € as : otp(as Na) < 6 and for no 8 €asNais (B,a)NE=0 
and for some y < a,@aM 6 C ay and y < min(a, \ (8 + 1)) and 
|a,| < 0} has order type 6. Let {az : ¢ < 9} list b (increasing with 
€). So for every € < @ there is yz € (@2,@e41) MN E, and let Be < ae41 
be such that as Naz C ag. and otp(ag, Na.) < 0; by shrinking 6 and 
renaming wlog 6: < y and az € ag,. Let €(€) =: otp(ag,). Lastly let 
Boat <a fa ti < fae) <inG) < fos}, clearly Wis « 
mod J* and let (remember (*) above) BE =: Af), (By. \ Bs.) BE, 


now By, C Ba, © By, mod J<,[A] by clause (B) of 1.7, and B,, # Bag, 


mod J<,[A] by (*) above hence By, \ Bg. #4 0 mod Je,[A]. Now 
Be, Ag.)41 = * mod I* by the previous sentence and by 1.5(*) which 


we are assuming respectively and I* C Je [A] by the later’s definition; 
so we have gotten B* 4 § mod Jey[A]. But for ¢ < ¢ < @ we have 
Bz 0 Be = 0, for suppose i € BIN Be, soi € Ag(e)41 and also 
fre) < faci @® < fe-(t) (as i € B? and as acy1 € ag, KI Atos 
respectively); now i € B® hence i € By, i.e., (where g is from 1.7 clause 
(D)*)f,.(2) > g(z) hence (by the above) fg.(i) > g(i) hence i € Bg, 
hence i ¢ Bz, contradiction. So (Bi : € < 0) is a sequence of 6 pairwise 
disjoint members of (Jz)[A])*, contradiction. 

(2) The proof is similar to the proof of 1.8(5), using 2.6(1) instead 1.7 (and 
a from 2.4(1) if \ > 0+ or 2.4(2) if X= 6++). ~ 


We note also (but shall not use): 
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CLAIM 2.7: Suppose (*) of 1.5 and 


(a) fo € IIA for a < 4,A € pef(A) and f = (fg : a < d) is a GE 
increasing 

(b) f obeys @ continuously on S*, where @ is a continuity condition for (S, 0) 
and A = sup(S) (hence A > @ by the last phrase of 2.3(1)) 

(c) J is an ideal on « extending Je)[A], and (fa/J : a < A) is cofinal in 
(IIA, <s) (e.g. J = JealA] + («\ B), BE Jeala] \ JealAl)- 

(d) (fi :a@ <4) satisfies (a), (b) above. 

(e) fa < fi for a € X\ S* (alternatively: (f/ : a < 2) satisfies (c)). 

(f) If 6 € S’ then J is cf(d)-indecomposable (i.e. if (Az : € < cf(d)) isa 
C-increasing sequence of members, of J then U.<cr(5) Ae € J). 


Then: 
(A) The set 
{6 <A: if 6 € S* and otp(as) = 6 then fs = fs mod J} 


e<cf 


contains a club of A. 
(B) The set 


{6 <A:ifa€S and 6 =sup(dMa,) and otp(aNas) = 6 
then fona; = fares mod J} 


contains a club of A. 


Proof. We concentrate on proving (A). Suppose 6 € S*, and fs ¢ f; mod J. 
Let 


Ais = {i< a: fa(i) < f5()} 
Ags = {i<n: felt) > fs}, 


So Aj.5U Ags € J*, suppose first A1,5 € J+. By Definition 4(2A), for every 
i € Aj 5 for every large enough a € az, fs(i) < fi (2), say for a € ag \ Bj. As 
J is cf(6)-indecomposable for some 8 < a we have {i < «: Bi < By € JT 
so fs | Ais < fp | Ais (and 6 < 6). Now by clause (c), FE =: {6 < \: 
for every 8 < 6 we have f% < fs mod J} is a club of A, and so we have 
proved 


6€k=> A165 E J. 
If Age fa < fh (first possibility in clause (e) implies it) also Ags € J 


hence for no 6 € S*M E do we have fs # fs mod J. If the second possibility 
of clause (e) holds, we can interchange f, f’ hence [6 € H => Ags € J] and 


we are done. 2.7 


We now return to investigating the J-,[A], first without using continuity 
conditions. 
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Lemma 5. Suppose (*) of 1.5 and X = cf(A) € pef(A). Then X» is semi 
normal for X. 


Proof. We assume » is not semi normal for \ and eventually get a 
contradiction. Note that by our assumption ([] \, <r) is 0*-directed hence 
minpef;(A) > 6+ (by 1.3(4)(v)) hence let us define by induction on € < 
0, f§ = (f§ a <A), Be and De such that: 


Q) @ f§ eT] 
(ii) a<B<A=> ff < fj mod Je(A] 
(ili) a < ARE <0 => f& < f? mod Jey] 
(iv) For¢<€<Oanda<r: fETAr<f§lA 
(II) (i) De is an ultrafilter on « such that: cf([] V/De) = A 
(ii) (f§/De : @ < A) is not cofinal in J] A/ De 
(iii) (f§+!/De : a < X) is increasing and cofinal in [] A/ De; moreover 
(iii)* Be € De and (f§t!/De : a < 4) is increasing and cofinal in 
TLA/(J<ala] + (« \ Be)) 
(iv) f§*"/De is above {f§/De : a < A}. 


For € = 0. No problem. [Use 1.8(1) + (4)]. 

For € limit < 0. Let g§ € []\ be defined by g§(i) = sup{ f$(i) : ¢ < &} 
for i € Az and f§(i) = 0 else, (remember that « \ Af € I*). Then choose 
by induction on a < A, f§ € J] X such that g§ < f§ and 8B <a> fg < fo 
mod Jz, [A]. This is possible by 1.5 and clearly the requirements (I) (i),(ii),(iv) 
are satisfied. Use 2.2(1) to find an appropriate De (i.e. satisfying II (i)+(ii)). 
Now (f§ : @ < A) and Deg are as required. (The other clauses are irrelevant.) 

For € = @. Choose f% by induction of a satisfying I(i), (ii),(iii) (possible 
by 1.5). 

For € = C+1. Use 1.6 to choose Be € DeNJ<y[A]\ Jen [A]. Let (g§ : aw < 2) 
be cofinal in ([] \,<p,) and even in ([] A, SF 1+ («c\ Be) and without loss 


of generality \,-) f$/De < g§/De and Nacr f& | AE < 9§ | Az. We get 
(f§ : @ < A) increasing and cofinal mod (Je ,[A] + (K \ Be)) such that 
g&, < f& by 1.5 from (g§ : a < ). Then get Dg as in the case “€ limit”. 

So we have defined the f§’s and D¢’s. Now for each € < @ we apply (II) 
(iii)* for (f§t1 sa <A), (f28 :a< A). We get a club C¢ of \ such that: 


a<BECe= fot Be< fg | Be mod Je] () 
So C =: (ecg Ce is a club of . By 2.2(1 - applied to (f2 : a < 2) (and the 
assumption “A is not semi-normal for \”) there is g € [[ \ such that 
ag < f? mod Je,{A] for a <2 («)1 
(not used) and by 1.5 wlog 
fe <g mod Je,[A] forE<@ — (*)2 
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For each € < 6, by II (iii), (iii)? for some ag < A we have 
gt Be<feh' | Be mod Jef] (*)s 


Let a(*) = supecg Ae, 80 a(*) < A and so 


gt Be < fat} | Be mod Jef] (*)a 
For ¢ < 0 let Bt = {i € At: g(t) < fo.)()}- By (Ja, BE € De; by 
(II) (iv) +(*)2 we know Bz € De, hence Bz # Bz,, mod Dg hence Be # 
Bey mod Je A]. 

On the other hand by (I)(iv) for each ¢ < 6 we have (BEM Az: € < ¢) is 


C-increasing and (as A? = % mod Je,{A] for each ¢ < 0) hence by I(iv) we 
have (Bz/I* : € < 6) is C-increasing, and by the previous sentence B? # Bz, , 
mod J<,[A] hence (Bz/I* : € < 6) is strictly C-increasing. Together clearly 
(Be, 1 Az,, \ Be : € < ) is a sequence of 6 pairwise disjoint members of 


(Je,[A])*, hence of (I*)*; contradiction to 9 > wsat(I*). 2.8 


Definition 5. (1) We say (By : \ € ¢) is a generating sequence for 
(i) By Cr and c C pef (A) 


(ii) J<y[A] = JerlA] + By for each X€ ¢ 
(2) We call B= (By: €c) smooth if: 
7€ By&X\; €c=> By, CB). 
(3) We call B = (By : \ € Rang(A)) closed if for each 


By 2D {i<K: A € pcf(A [ By)} 


3.4. Fact 


Assume (*) of 1.5. 


(1) Suppose ¢ C pef(A), B = (By : \ € ¢) is a generating sequence for A, 
and B C k, pef(A | B) C ¢ then for some finite 0 C ¢,B C Uneo Bu 
mod J”. - 

(2) cf({]X/I*) = max pef(X) 


Remark 2.10A. For another proof of 2.10(2) see 2.12(2) + 2.12(4) and for 
another use of the proof of 2.10(2) see 2.14(1). 


Proof. (1) If not, then J = I* + {BN Uc, Bu : 0 € ¢,0 finite} is a family 
of subsets of «, closed under union, B ¢ J, hence there is an ultrafilter 
D on k. disjoint from I to which B belongs. Let uw =: cf([],.,, Ai/D); 


necessarily ys € pef(A | B), hence by the last assumption of 2.10(1) we 
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have py € c. By 1.8(2) we know B,, € D hence BNB,, € D, contradicting 
the choice of D. 

(2) The case 9 = No is trivial (as wsat(I*) < No implies P(k)/I* is a 
Boolean algebra satisfying the No-c.c. (as here we can subtract) hence 
this Boolean algebra is finite hence also pef() is finite) so we assume 
6 > No. For B € (I*)* let \(B) = max pef wre (A | B). 

We prove by induction on \ that for every B € (I*)t, cf(T] A, <r+4(«\B)) 
= (B) when \(B) < A; this will suffice (use B = & and A = |J]J,~,, Ai|*) 
Given B let \ = \(B), by notational change wlog B = k. By 1.9, pef([] A) 
has a last element, necessarily it is A =: A(B). Let (fa : a < 2A) be 
<j_,[y) increasing cofinal in |] A/Jey{Al, it clearly exemplifies max pcf(\) < 
ef([] \/I*). Let us prove the other inequality. For A € Jz [A] \ I* choose 
F4 C JJ) which is cofinal in [] A/(I* + (« \ A)), |Fa] = \(A) < A (exists 
by the induction hypothesis). Let y be a large enough regular, and we now 
choose by induction on € < 0, Nz, g- such that: 


(A) (i) Ne <(H Me <x) 
(i) ||Nell = 

(iii) (Ne : ze €) € Neqi 

(iv) (Ne : € < 6) is increasing continuous 

(v) jeep A+ 1} C No, A, I*} © No, (fa : w < A) € No and the 

function A> F'4 belongs to No. 

(B) (i) ge € []\ and ge € Ne+1 
(ii) For no f € NzN[]A does gz <r f 
(ili) ¢ < e&Xi > |e| => ge(t) < ge(?). 


There is no problem to define N., and if we cannot choose g- this means 
that N-N[] \ exemplifies cf([] \, <) < A as required. So assume (Nz, ge : € 
9) is defined. For each ¢ < @ for some a(e) < A, ge < fae) mod J<,[X] hence 
ale) Sa<A=> ge <j xp) fa- As A = cf(A) > 8, we can choose a < A such 
that a > U.cg a(e). Let Bs = {i < kK: ge(t) = fa(t)}; so for each € < @ we 
have (B-.A¢ : € < &) is increasing with e, (by clause (B)(iii)), hence as usual 
as 6 > wsat(I*) (and 6 > No) we can find e(*) < @ such that A, Bes)in = 
B..) mod I* [why do we not demand ¢ € (€(*),@) > Be = Bz.) mod I*? 
as # may be singular]. Now as ge.) € Necx)41 and fo € No < Ne(«)41 clearly, 


by its definition, Bz.) € Ne(s)41 hence Fp...) € Ne(«)41 Now: 


Ge(x)t 1 (&\ Becay) =r* Ge(wy4 | (CK \ Bewyti) < fa f (6 \ Becay4s) 
=r fa | (« \ Bex) 
[why first equality and last equality? as B.(.)41 = Bei.) mod J*, why the < 
in the middle? by the definition of Bz;,)+1]. 
But gew+1 | Bea) € Tien... 0 and Bewy € JealA] as ge < fate) S fa 


mod Je [A] so for some f € Pas S C I]A we ies Ge(x)41 | Bera) < fb Bee) 
mod I*. By the last two sentences 
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Je(*)+1 = max{ f, ‘Fat mod I* («) 
Now fo € New)41 and f € Newy4i (as f € Bens ol EB | ee ae 
Ne(x)-+1 the function B+ Fg belongs to No < Neopet and Bea) € Neca) 41 
as {Ge(x), fa} € Ne(«)41) So together 
max{f, fa} € Ne(«)413 («*) 


But (*), (**) together contradict the choice of g.(.)41 (i.e. clause (B)(ii)). 


2.10 


Definition 6. (1) We say that I* satisfies the pcf-th for (the regular) 
(A, 0) if [[A/I* is 6-directed and ([] A, <j ,[y)) #8 A-directed for each 
d and we can find (By : X € pef ;«(A)), such that: 
BACK, Jey, I] =P 4+{ By: € ANpet» (A)}, By ¢ Teal, I” 
and TJ(A | By)/JealA, I*] has true cofinality (so By € Je<[A] \ 
Jex{A] and J<y[A] = Jea[A] + By). - 7 
(1A) We say that I* satisfies the weak pcf-th for (A,@) if (TIA, <r) és 
0-directed 
each ([] A, <j.[y)) #8 A-directed and 


there are By 4 Ck fora <X€ pefy. (A) such that 
a<B<pepefp(A) > Bua C Bug mod Je,[A, I*] 


Je [A] =1* + {Bua i a<p<A,p€ pefy-(A)} 
and 
[a | Bua), <7) has true cofinality 


(1B) We say that I* satisfies the weaker pcf-th for (A, 0) if TI] r, <r) is 0- 
directed and each (TJ), <j eypxy), is A-directed and for any ultrafilter 
D onk disjoint to J<o[A| letting \ = tef(T] A, <p) we have: > 6 and 
for some B € DJ [A]\J<alAl, the partial order (T](A | B), <3.) 
has true cofinality X. 7 _ 

(1C) We say that I* satisfies the weakest pcf-th for (A, 0) if ([] A, <r) és 
6-directed and (TJ), <jey[y) 18 A-directed for any » = 0 

(1D) Above we write X instead (A, 0) when we mean 


6 =sup{0: ({[ A, <r) is 0* — directed}. 


(2) We say that I* satisfies the pcf-th for 0 if for any regular such that 
liminf;-() > 6, we have: I* satisfies the pcf-th for X. We say that 
I* satisfies the pcf-th above 1 (above py ) if it satisfies the pcf-th for 
r with liminf;-(A) > pw (with {4:4 > uw} = K mod I*). Similarly 
(in both cases) for the weak pcf-th and the weaker pcf-th. 
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(8) Given I*,0 let I ={ACK:AELT or AG I* andI* +(k\ 
A) satisfies the pcf-theorem for 0}. 


Jp*t =: {A C «: wsat(I* | A) <0 or AE I*}; 


similarly ae we may write, Jj [I*]. 

(4) We say that I* satisfies the pseudo pcf-th for d if for every ideal I on 
k extending I*, for some A € I+ we have (T](A | A), <r) has a true 
cofinality. 


CLAIM 2.12: 


(1) If («) of 1.5 then J* satisfies the weak pcf-th for (A, 0*). 

(2) If («) of 1.5 holds, and [] A/J* is 0*++-directed (i.e. 0+ < min X) or just 
there is a continuity condition for (67,6) then I* satisfies the pcf-th for 
(A, 07). 

(3) If I* satisfy the pcf-th for (A, ) then I* satisfy the weak pcf-th for (A, 0) 
which implies that I* satisfies the weaker pcf-th for (\, 9), which implies 
that I* satisfies the weakest pcf-th for (A, 0). 


Proof. (1) Let appropriate \ be given. By 1.5, 1.8 most demands holds, 
but we are left with seminormality. By 2.8, if 4 € pcf(A), then 2 is 
semi normal for 4. This finishing the proof of (1). 
(2) Let A € pef(A) and let f, B be as in 2.2(4). By 2.4(1) + (2) there is @, 
a (A, 6)-continuity condition; by 2.5(1) wlog f obeys a, by 2.6(1) the 
relevant B,/I* are eventually constant which suffices by 2.2(2). 
(3) Should be clear. 2.12 


CLAIM 2.13: Assume ([] A, <r«) is given (but possibly (*) of 1.5 fails). 


(1) If I*,A satisfies (the conclusion of) 1.6, then I*,\ satisfy (the 
conclusions of) 1.8(1), 1.8(2), 1.8(3), 1.8(4), 1.9. 
(1A) If I* satisfies the weaker pcf-th for then they satisfy the conclusions 
of 1.6 and 1.5. 
(2) If I*,X satisfies (the conclusion of) 1.5 then I*,. satisfies (the 
conclusion of) 1.10. 
(2A) If I* satisfies the weakest pef-th for A then I*, \ satisfy the conclusion 
of 1.5. 
(3) If I*,X satisfies 1.5, 1.6 then I*,X satisfies 2.2(1) (for 2.2(2) — no 
assumptions). 
(4) If I*,A satisfies 1.8(1), 1.8(2) then I*,. satisfies 2.2(3) when we 
interpret “seminormal” by the second phrase of 2.2(1) 
(5) If I*,X satisfies 1.8(2) then I*, 2 satisfies 2.10(1). 
(6) If I*\ satisfy 1.8(1) + 1.8(3)(i) then I*, satisfies 1.8(2) 
(7) If I*, \ satisfies 1.8(1) + 1.8(2) and is semi normal then 2.10(2) holds 
ie. 


ef(] [ \, <r+) < sup pef;, (A). 
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(8) If I*, \ satisfies 1.5 + 1.6 then they satisfy 2.10(2). 
Proof. (1) We prove by parts. 


Proof of 1.8(2). Let A = tef([ A/D); by the definition of Je,{A], clearly 
DO Jex{A] = 0. Also by 1.6 for some B € D we have A = tef([](A | 
B), A 7s a) so by the previous sentence B ¢ Jey[A], and by 1.4(5) we have 


B € Je [A], together we finish. 
Proof of 1.8(1). Rep eat the proof of 1.8(1) replacing the use of 1.5 by 1.8(2). 


Proof of 1.8(8)(i). Let J =: Ue, J<ulAl, so J © J<,[A] is an ideal because 
(Jey[A] :  < A) is C-increasing (by 1.4(2)), if equality fail choose B € 
Je x{A]\ J and choose D an eae on &, disjoint to J to which B belongs. 
Now if w = cf(u) < A then pt < X (as A is a limit cardinal) and p = 
cf(u) & wt <A> DN Jgy[A] = DO Jey+[A] = 0 hence by 1.8(2) we have 
uw # cf([[ A/D). Also if w = cf(u) > A then DN Je, [A] C DA Je[\] 4 0 
hence by 1.8(2) we have y 4 cf([] \/D). Together contradict ion by 1.3(7). 


Proof of 1.8(3)(ii). Follows. 
Proof of 1.8(4). Follows. 


Proof of 1.9. As in 1.9. 


(1A) Check. 
(2) Read the proof of 1.10. 
(2A) Check. 
(3) The direction => is proved directly as in the proof of 2.2(1) (where 
the use of 1.8(1) is justified by 2.13(1)). 


So let us deal with the direction <. So assume f = (fa : a < A) isa 
sequence of members of [] \ which is < Je,{d/-mereasing such that for every 
ultrafilter D on « disjoint to Je,[A] we have: \ = tef([]A\,< D) iff f is 
unbounded (equivalently cofinal) in ([] \,<p). By (the conclusion of) 1.5 
wlog f is < Je {aj increasing. 

By 1.5 there is g € [JA such that fy < g mod J<,{A] for each a < A, 
and let By =: {i < «: g(t) < fa(i)}. Hence By € J<,[A] (by the previous 
sentence) and (By/J<,[A] : w < 2) is C-increasing (as (fa : a < A) is < Seis 
increasing). Lastly if B € J< [A], but B\ Ba ¢ Je [A] for each a < 4, let D be 
an ultrafilter on x, disjoint to J<,[A] + {Ba : a < A} but to which B belongs, 
so tef([] \, <p) = (by 1.8(2) which holds by 2.13(1)) but {f4/D:a< A} 
is bounded by g/D (as fa/D < g/D by the definition of B,.), contradiction. 
So the sequence (By : a < A) is as required. 


(4)-(6) Left to the reader. 
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(7) Let for \ € pef(A), (BA : i < A) be such that Je [A] = Jea[A] + 
{BA :i < X} (exists by seminormality; we use only this equality). 
Let (f2" : a < A) be cofinal in (T](A | B}),<j.p5)), it exists by 


1.8(1). Let F be the closure of {f**: a < \,i < A,X € pef(A)}, 
under the operation max{g,h}. Clearly |F| < suppcf(A), so it 
suffice to prove that F’ is a cover of ([[ A, <r). Let g € J]A, if 


Af € F)(g < f) we are done, if not 
T={AU{i<k: f@>g@}: fekR Acer} 


is No-directed, « ¢ I, so there is an ultrafilter D on « disjoint to 
I, (so f€ F> f <pg) and let \ = tef([] A/D), so by 1.8(2) we 
have DN J<y[A] \ Jey[A] 4 0, hence for some i < A, BA € D, and 
we get contradiction to the choice of the {f*** : a < A}(C F). 

(8) Repeat the proof of 2.10(2) (only using J = {A Cr: if A ¢ 
Jey{A] then cf([] A/I*) < A}; if k ¢ J let D be an ultrafilter on « 
disjoint to J, and use 1.6). 214 


—— 


CLAIM 2.14: If I* satisfies pseudo pcf-th then 


(1) We can find ((J¢,9¢) : ¢ < ¢*), ¢* a successor ordinal such that Jo = 
I*, Jeu. ={ACxw: if AZ Je then tef([](A | A), <3.) = Oc} and for 
no A € (J¢)* does ([](A | A), <y,) has true cofinality which is < ¢. 

(2) If I* satisfies the weaker pcf-th for \ then I* satisfies the pseudo pcf-th 
for A. 


Proof. (1) Check (we can also present those ideals in other ways). 
(2) Check. 2.14 


4. Reduced Products of Cardinals 


We characterize here the cardinalities [],—,, Ai/D and Tp((A; : i < «)) using 
pef’s and the amount of regularity of D (in 3.1-3.4). Later we give sufficient 
conditions for the existence of < p-eub or < p-eub. Remember the old result of 
Kanamori [3] and Ketonen [5]: for D an ultrafilter the sequence (a/D : a < k) 
(i.e. the constant functions) has a <p-eub if reg(D) < «; and see [8, II 3.3] 
(for filters). Then we turn to depth of ultraproducts of Boolean algebras. 

The questions we would like to answer are (restricting ourselves to “A; > 
2"” or “\; > 2?” and D an ultrafilter on « will be good enough). 

QUESTION A: What can be Carp =: {]],-,, 4i/D : 4 a cardinal for i < «} 
i.e. characterize it by properties of D; (or at least Carp \ 2“) (for D a filter 
also Tp([[ A;) is natural). 

QuEsTION B: What can be DEPTH} = {Depth ([],_, 
regular cardinal} (at least DEPTH} \ 2", see Definition 10). 


di/D) 4 Aja 


The PCF Theorem Revisited 469 


If D is an Ny-complete ultrafilter, the answer is clear. For D a regular 
ultrafilter on K,A; > No the answer to question A is known [1] in fact it 
was the reason for defining “regularity of filters” (for A; < No see [9, Sh-a, 
VI §3 Th.3.12 and pp. 357-370] better [Sh-c, VI §3] and Koppleberg [4].) 
For D a regular ultrafilter on «, the answer to the question is essentially 
completed in 3.22(1), the remaining problem can be answered by pp (see [8]) 
except the restriction (V, < A)(|a|8° < A), which can be removed if the 
cov = pp problem is completed (see [8], [AG]). So the problem is for the 
other ultrafilters D, on which we give a reasonable amount on information 
translating to a pcef problem, some times depending on the pcf theorem. 


Definition 7. (1) For a filter D let reg(D) = min{@: D is not 0—regular} 
(see below). 
(2) A filter D is 0-regular if there are Az € D fore < 6 such that the 
intersection of any infinitely many A_-s’ is empty. 
(3) For a filter D let 


reg,(D) = min{@: there are no Az € D* fore < @ such that 
noi<« belongs to infinitely many A's} 
and 
reg(D) =:{0: there are no A. € Dt fore <6 such that : 
e<¢=>A¢CA- mod D and noi<k 
belongs to infinitely many A,'s}. 


(4) reg’(D) = min{@: D is not (0,0)—regular} where “D is (0,0)-regular” 
means that there are Az € D for a < 0 such that the intersection of 
any o of them is empty. Lastly reg? (D), reg3(D) are defined similarly 
using A- € D*. Of course reg(I) etc. means reg(D) where D is the dual 
filter. 


Definition 8. (1) Let 
tcf p,,.(] | i) = sup{tef(]] Ni/D):u<ri= cfr; <% fori <K and 
tef(| | A; /D) is well defined} and 


hefpw(T]_, vi) = sup{ef(]] Ni /D):w< ri = cfAi < yi}; 


if 6 = Xo we may omit it. 
(2) For E a family of filters on « let htcfey([],e, < i) be 


sup{tef(]] Ni/D): DEE and p< Ay = chi < a; fori < « and 


vef( TT \;/D) is well defined}. 
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Similarly for hefz,, (using cf instead tcf). 

(3) hefp CT]. 0%) is hefe,,(T],.,0i) for E = {D' : D'a filter on 
extend ing D}. Similarly for htcfp ,,. 

(4) When we write I e.g. in hefr,, we mean hefp,,, where D is the dual 
filter. 


CLAIM 3.3: 


(1) reg(D) is always regular 
(2) If @< reg,(D) then some filter extending D is 6-regular. 
(3) wsat(D) < reg, (D) 
(4) reg(D) < egg (D) < reg, (D) 
(5) reg,(D) = min{@: no ultrafilter D; on « extending D is 6 — regular} 
(6) If D C E are filters on « then: 
(a) reg(D) < reg(E) 
(b) reg, (D) > reg, (E) 


Proof. Should be clear. E.g. (2) let (ue : € < 6) list the finite subsets of 0, 
and let {A- : « < 6} C Dt exemplify “6 < reg,(D)”. Now let D* =: {AC k: 
for some finite wu C @, for every ¢ < 6 we have:u C ue > Ae C A mod D}, 
and let At = U{Ac: € € uc}. Now D* is a filter on « extending D and for 
Ee <6 we or Az ¢ D. Finally the intersection of AZ, M Az, M... for distinct 
En < 6 is empty, because for any member j of it ae can ‘find Gn < 0 such 
that 7 € Ac, and én € uc,. Now if {¢, : n < w} is infinite then there is no 
such 7 by the choice of (Az : « < 6), and if {¢, : n < w} is finite then wlog 
Anw $n = 60 contradicting “ug, is finite” as A, 2, En € uc,- Lastly 0 ¢ D* 
because A? £4 mod D. 3.3 


4.1. Observation 


[Tien Xé/Z| > |N§/T| holds when A, —,, Ai > Xo. 


4.2. Observation 


(1) [acu /TI > bteff(T,c, 3). - 
(2) If I* satisfies the pcf-th for \ or even the weaker pcfth for (see 


Definition 6) then: cf([] A/I*) = max pef ;. (A). 
(3) If I* satisfies the pef-th for y for and min(X) > py then 


hefpu([ [ A) = hef5 ,,(] [ 4) = htef’p,,.(] J) 
whenever D is disjoint to I*. 
(4) hef pu (Lice ri) = hefp (Tex ri). P : 
(5) Tee ru/I > hefr yu ( Lic. ri) = ety (sz, ri) = htefr Tex Ai) 
and heft (ex Xi) = hief (lle, Xi): 
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Remark 3.5A. In 3.5(3) concerning htcfp,,, see 3.10. 


Proof. (1) By the definition of htcf7 it suffices to show |]];—,, Ai/| = 
tef([] 4/2’), when I’ is an ideal on « extending I, Xi = cfr < A; 
for i < « and tcf([J,-,, 4/1’) is well defined. Now |[J,_,, As/I| = 
Whee M/1 = We. N/M! = fe, 4/1’), 80 we have finished. 

(2) By 2.13(1A)clearly I*, satisfies 1.5, 1.6 hence by 2.13(1), (2) 
also 1.8(1), (2), (3), (4) and 1.9 and 1.10. Now by 2.13(8) also (the 
conclusion of) 2.10(2) holds which is what we need. 

(3) Left to the reader (see Definition 6(2) and part (2)). 

(4), (5) Check. 3.5 


CLAIM 3.6: If A= |[],<,, As/Z| (and A; > No and, of course, J an ideal on 
«) and @ < reg(I) then \ = X?. 


Proof. For each i < k, let (ni, :a < ri) list the finite sequences from ,;. Let 

Mi = (Ai, Fi, Gi) where Fi(a) = l(a); Gila, 8) is m(8) if B < Lg(na)(= 

F;(a)), and F(a, 8) = 0 otherwise; let M = [J,_,, Mi/I so ||M|| = |[T]«/1| 

and let M = ([] \i/I, F,G). Let (A; : i < 6) exemplifies I is 0-regular. Now 

(*)1 We can find fe “wand f- € [],2,, f(i) for e < 0 such that: e << ¢ < 
6 => fe <r fe [just for i < K let wi = {e < 6:71 € AG}, it is finite and let 
f(t) = |wi|+1 and f.(2) = |eNw;| < f(a), and note e < (&i € A-N Ac => 
fet) < fe@]. 

(*)2 For every sequence g = (g- : € < 0) of members of [J;_,, Xi, there is 
he [],2,A% such thate <0 > MF F(h/I, fe/I) = ge/I [why? let, 
in the notation of (*):,h(7) be such that 7,(;) = (ge(t) : € € wi) (in the 
natural order)]. 


So in M, every 6-sequence of members is coded using f/I, f-/I (for « < @) 
by at least one member so ||M||° = ||M||, but ||1/|| = | [],;<,,A:/Z| hence we 
have proved 3.6. 3.6 


4.3. Fact 


(1) For D a filter on «, (A;, Az) a partition of « and (non zero) cardinals 
A; for i < K we have 


TT. /P1= NTT, 0/8 + 40 x TT / (2 + 40)| 


(note: | [],2,,%/P(«)| = 1). 

(2) Del =: {A Caw: |[],2, Ai/(D + (« \ A))| < pu} is a filter on « (pw an 
infinite cardinal of course) and if No < pw < [],2,,Xi/D then DH! is a 
proper filter. 

(3) TfA < |[],2,,A«/2|, (Ai infinite, of course, J an ideal on K) and A € 
I* => |TIje4Xi/1| = A and o < reg, (J) then |] Ai/Z| > A? 
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Proof. Check (part (3): by the proof of 3.3(2) we can find A; € I* fore <o 
such that for finite u C 0, Neew Ae € J* and continue as in the proof of 3.6). 


CLAIM 3.8: If D C FE are filters on « then 


WT,..>/2! s IL... Pees as UN 4s /(D+(K\A))|+(2°/D) +o. 


We can replace 2"/D by |P| if P is a maximal subset of E such that 
A#ABeEP=(A\B)U(B\A)#4O mod D. 


Proof. Think. 


Lemma 6. |[];-,,\;/D| < (0"/D+hefp,o([],<, Ai))<" (see Definition 8(1)) 
provided that: 


6 > regg(D) (*) 


Remark 3.9A. (1) If @ = 6}, we can replace 6" /D by 0£/D. In general we 
can replace 0*/D by sup{]],.,, f(i)/D: f € 0" }. 

(2) If D satisfies the pcf-th above 6 (see 2.11(1A), 2.12(2) then by 3.5(3) 
we can use htcf* (sometime even htcf, see 3.10). But by 3.7(1) we can 
ignore the A; < 0, and when i < kK > \; > 6 we know that 1.5(*)(a) 
holds by 3.3(3). 


Proof. Let X = 0*/D + hefp 6([],<, Ai). Let for ¢ < 0, ue = : AMSlie. we = 
(0°/D + hefp Tle, i)'S!, clearly ue = rae Let y = (sup;e, Ai)? and 
Ne < (A(x), €,<%) be such that ||Nel] = ue, NSIS) C Nc, A +1 Ne and 
{D, (0; $4 k) } e Ne and le< ¢ => Nz ~< Ne]. Let N = U{Ne LC < 6}. 
Let g* € [],2,,Ai and we shall find f € N such that g* = f mod D, this 
will suffice. We shall choose by induction on ¢ < 6, f€ (e < 3) and AS such 
that: 


i € — and fe E Ne. 


(d) i € AS C ri +1, JAS] < |C) 41, and (AS : ¢ < 6) is increasing 
continuous (in ¢). 


(ce) f a ) = min(A$ \ g*(i)); note: it is well defined as g*(i) < A; € AS 
(f) f — mod D 

(g) fi < fe < fe mod (D+ {i< nk: g*) 4 LE}). 

(h) f2@) © Ag 


So assume ra res is defined for every e < ¢. If ¢ = 0, let AS = {\;}, if ¢ 
limit AS = Ueec Aj, for ¢ =e +1, AS will be defined in stage ¢. So arriving 
to ¢, AS is well aaa and it belongs to N¢: for ¢ = 0 check, for ¢ =e+1, 
done in stage ¢, for ¢ limit it belongs to Nc as we have Ne C Ne and: 
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E<¢ => Ne < Ne and Af € Ne. Now use clause (e) to define f2/D. As 
(AS si <6) © Ng, |AS| < [Cl +1 < 6 and 6°/D <X<A+1C Ny, clearly 
[Tic [AS|/D] < A hence {f/D: f € T];e, 49} C Ne hence f2/D € Ne 
hence there is ft € N¢ such that f? € f2/D ie. clause (f) holds. As g* < f2 
clearly g* < ft mod D, let y§ = fi<n:g*(i)> fe}, yo a=ifi<n:id 
ys andcf (2 (i)) < 0} yS = 6 \ ys af ys. So (y$ : e < 3) is a partition of « and 
g* < ft mod (D+ y§) for e =1, 2. 
Let yg ={i<k: cf(f2(z)) = 0} so ft € Nc, and 6 = N¢ hence ys E Ne, 
so (Ts, fei), <p+y§) € Ne. Clearly ys Cc ys Cc ys U ys. Now 
f(T... K (@), <p4y§) Ss hefp. yal]... Ai) 
< ae Cc 
< hefpo(]]._. Nu) CAKAFION, 


hence there is F € Ng, |F| < A, F C TTicys fé(i) such that: 


(va)ig¢ [] RO > GFE F\g<f mod (D+y§))] 
icy§ 
As\+1C N necessarily F C Ne. Apply the property of F to (g* | yS) U 
Oue\y$) and get fe € FC Ne such that g* < fi mod (D+ yS). Now use 
similarly [],—,, cf(f2(#))/(D + (« \ yf) < |0"/D| < A; by the proof of 3.7(1) 
there is a function ff € No NT] ,<, fé(i) such that g* | (ys + y$) < f2 
mod D. Let A$* be: AST! U {f2W}. 

It is easy to check clauses (g), (h). So we have carried the definition. Let 

Xe arfic nw: fy@) < PW}. 
Note that by the choice of f2, f?,, we know X¢ = yS Uy$ mod D, if this 
last set is not D-positive then g* > f? mod D, hence g*/D = f?/D € Ne, 
contradiction, so y$ U y$ 4 mod D hence X¢ € Dt. Also ((y§ Uy$)/D : 
¢ < 6) is C-decreasing hence (X;¢/D:¢ < 6) is C-decreasing. 

Also if i € X¢ 1 X¢, and G1 < then f2(i) < F244, < £2 (4) (first 
inequality: as Ao ce A? and clause (e) above, second inequality by the 
definition of X¢,, hence for each ordinal i the set {¢ < 6: % © X¢} is finite). 
So 6 < regg(D), contradiction to the assumption (*). 3.9 


Note we can conclude 
CLAIM 3.9B: 


Tick 4i/D = sup{(T]ic, f()) 7820 + hefd, Tica Ai) 88): Dy 
is a filter on & extending D such that 
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Ae Dt = [J \i/(Di + 4) = [J i/Di 


1<kK t<K 
and f € 0", f(i) < Aj} 


Proof. The inequality > should be clear by 3.7(3). For the other direction 
let ps be the right side cardinality and let Dp = {k \A: AC « andif A € 
D* then [J,-,,.Ai/(D + A) < }, so we know by 3.7(2) that Dp is a filter on 
k extending D. If 0 € Do we are done so assume not. Now pp > 2"/D (by the 
term ([], f(i)/Do)<t82(2) so by 3.8 we have J],-,, \i/Do > p (use 3.8 with 
D, Do here corresponding to D, E there). Now the same holds for Dp + A 
for every A € Dj. Also AC BC wand AE Dj = J], 2, Ai/(Do + A) < 
[lc, Xi/(D1+B) so for some B € Dj , Di =: Do +B satisfies the requirement 
inside the definition of pu, so 4 > hef p, ([],<,, A1)<"°8? (D1). 

By 3.9 (see 3.9A(1)) we get a contradiction. 3.9B 


Next we deal with existence of <p-eub. 
CLAIM 3.10: 


(1) Assume D a filter on k,g% € “Ord for a < 6,g* = (gi : a < 6) is 
<p-increasing, and 


cf(6) > 0 > reg, (D). (x) 
Then at least one of the following holds: 


(A) (gX : a <6) has a <p-eub g € “Ord; moreover @ < lim inf p (cf[g(i)] : 
i<k) 

(B) ef(6) = reg, (D) 

(C) For some club C of 6 and some 6; < 6 and y; < @/ and w; C Ord 
of order type 7 for i < «, there are fy € [],-,, wi (for a € C) such 
that fo(t) = min(w; \ gi (4)) and a € C&B € C&a < B= fa <p 
fako~fo =p fok-fa Sp gnkegt, < fa: 


(2) In (C) above if for simplicity D is an ultrafilter we can find w; C Ord, 
otp(wi) = Vi, (awe : € < cf(d)) increasing continuous with limit 6, and 
he €]],<,, wi such that fo, <p he <p fa,,, moreover, A,.,, Yi <w. 


Proof. (1) Let o = reg,(D). We try to choose by induction on ¢ < 0, 9¢, fac 
(for a < 6), AS, a¢ such that 


(a) AS = (AS :i<k). 

(b) AP = {fae,e(#), ge(8) © < GU {[supacs 9%,(8)] + 1}- 

) fa.c(é) = min(A$ \ g*(i)) (and fac € “Ord, of course). 

) ag is the first a, Yecc, We < a < 6 such that [3 € [a,d) > fac = fac 
mod D] if there is one. 

€) gc < fac,¢ moreover g¢ < Maxi fa,.c, lef but for noa < O we have 

(¢) 9¢ S fac.c ¢ foc.¢r 1s} but fi 6 do we h 

ge < max{g%,1} mod D. 


The PCF Theorem Revisited 475 


Let ¢* be the first for which they are not defined (so ¢* < a). Note 
e<&<Ck&agsa<db= foe =v fac& fae < fae& fag AD fae (*) 


[Why last phrase? applying clause (e) above, second phrase with a, € here 
standing for a, ¢ there we get Ap =: {i < « : max{g*(i),1} < g-(i)} € Dt 
and applying clause (e) above first phrase with ¢ here standing for ¢ there 
we get Ay = {ti < &: ge(t) < fo e(t) or ge(t) = 0 = fa e(i)} € D, hence 
ApN A, € D*, and g.(i) > 0 for i € AgM Ai (even for i € Ag). Also by clause 
(c) above g*(i) < ge(i) > fae(t) < ge(i). Now by the last two sentences 
i € Ag Ar > gilt) < Gel) < fare’) > fant) < Gelt) < fare(i), together 
fos #D See as required] 


Case A. ¢* =o and Upc, a¢ < 6. Let a(x) = Ucc, ae, for ¢ < o let 
yo={t< Kk: fos clit) # focs)c+i()} #4 9 mod D. Now for 
i<k, (fo(x),c(4) : ¢ < @) is non increasing so 7 belongs to finitely 
many yc’s only, so (ye : ¢ < a) contradict o > reg, (D). 

Case B. (* =o and Ure, a¢ = 46. So possibility (B) of Claim 3.10 holds. 

Case C. C* <a. 


Still AS (i < K), face (a < 5) are well defined. 


Subcase C1. ag» cannot be defined. 

Then possibility C of 3.10 holds (use w; =: AS’, fa 7] esc eee Je 

Subcase C2. ac can be defined. 

Then fo,.,¢* is a <p-eub of (gi : a < 6) as otherwise there is ge as 
required in clause (e). Now Fat c* is almost as required in possibility (A) 
of Claim 3.10 only the second phrase is missing. If for no 61 < 0,{i<K: 
cf [fac ¢* (4)] < 61} € D*, then possibility (A) holds. 


So assume 6; < @ and B =: {i < &: No < cf[fa,.,c« (t)] < O1} belongs to 
D*, we shall try to prove that possibility (C) holds, thus finishing. Now we 
choose w; for i < K: for i € « we let w? =: { fa,«,c+ (i), [SuPacs 9%, ()] + 1}, for 
i € B let w;} be an unbounded subset of fo,.,¢* (i) of order type cf[fa.« c+ (2)] 
and fori € «\ B let wi = 0, lastly let w; = w? Uw}, so |wi| < 61 as required 
in possibility (C). Define fy € “Ord by fa(i) = min(w; \g%(z)) (by the choice 
of w? it is well defined). So (fa : a < 6) is <p-increasing; if for some a* < 6, 
for every a € [a*,d) we have fa/D = fa«/D, we could define ge» € “Ord 
by: 

ge | B= fox (which is < face cx); 

ge | (K\ B) = 0,\B 

Now ge» is as required in clause (e) so we get contradiction to the choice 
of ¢*. So there is no a* < 6 as above so for some club C of 6 we have 
a<6€CS= fo #p fg, so we have actually proved possibility (C). 


(2) Easy (for A; 7% < w, wlog 6 = reg,(D) but reg,(D) = reg(D) so 6; = 
reg(D)). 3.10 


CLAIM 3.11: 
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(1) In 3.10(1), if AX = 6 = cf(A), g* obeys G@ (@ as in 2.1), @ a 0-weak 
(S, @)-continuity condition, S C A unbounded, then clause (C) of 3.10 
implies: (C)’ there are 6; < reg,(D) and A- € D* for « < @ such 
that the intersection of any 0/ of the sets A. is empty (equivalently 
i<«K=> (AS"¢e)[i € Az] (reminds (o, 6 )-regularity of ultrafilters). 

(2) We can in 3.10(1) weaken the assumption (*) to («)/ below if in the 
conclusion we weaken clause (A) to (A)! where 


(«)’ cf(6) > 6 > reg(D) 
(A)! There is a <p-upper bound f of {g* : a < 6} such that no f’ <p f 
(of course f’ € “Ord) is a <p-upper bound of {g% : a < 6} and 
@ < lim inf p (cf[f (2)] : i < &) 
(3) If g* € “Ord, (g% : a < 6) is <p-increasing and f € “Ord satisfies (A)! 
above and 
(*)” cf(d) > wsat(D) and for some A € D for every i < k, cf(f(i)) = 


wsat(D) then for some B € D* we have [],_,. cf[f(2)]/(D + B) has 
true cofinality cf(d). 


— 


1<k 


Remark. Compare with 2.6. 


Proof. (1) By the choice of @ = (ag : a < A) as C (in clause (c) of 3.11(1)) 
is a club of 4, we can find 8 < X such that letting (a, : € < 6) list {a € 
ag : otp(aNag) < 6} (or just a subset of it) we have (a2, ae41) NC 4 0. 
Let ye € (Qe, Qe41) NC, and € € (Qe,Qe41) be such that {a¢ : 
¢ < e} C ae, and as we can use (a2, : € < @), wlog & < ye. For 
C< Ole Be = {i < we: fat) < fat) < fx) < foesild) and 
sup{ fa, (7) + 1 : Eg < ¢} < sup{ fa, (7) + 1 : & < ¢+ ine 
(2) In the proof of 3.10 we replace clause (e) by 
(e") ge < fac,¢ and for a <6 we have fa < ge mod D 
(3) By 1.8(1) 3.11 


CLAIM 3.12: 


(1) Assume \ = tef([] A/D) and p = cf(pu) < A then there is N <prA,rNV a 
sequence of regular cardinals and ps = tef([] X’/D) provided that 


uu > reg,(D),min(A) > reg?” (D) whenever o < reg, (D) (x) 
(2) Let I* be the ideal dual to D, and assume (*) above. If (*)(@) of 1.5 
holds and yz is semi-normal (for (A, /*)) then it is normal. 


Proof. Part (2) follows from part (1) by 2.2(3). Let us prove (1). 


Case 1. js < liminfp(A) 
We let 


fe Bw ifu<; 
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and we are done. 

Case 2. lim inf +D(A) > 6 > reg,(D), uw > 0, and (Vo < reg, (D))[reg?(D) 
< OJ. 

Let 6 =: reg,(D). There is an unbounded S C yw and an (S,6@)-continuity 
system G@ (see 2.4). As [] A/D has true cofinality A, \ > pz clearly there are 
g*, € [JA for a < pw such that 9* = (g* : a < py) obeys G@ for A*[A] (exists 
as 0 < liminf p(A)). 

Now if in claim 3.10(1) for g* possibility (A) holds, we are done. By 3.11(1) 
we get that for some o < reg, (D) we have reg? (I) > ys, contradiction. 
Case 3. liminfp(A) > 0 > reg,(D), uw > 0, and (Vo < reg, (D))[reg?(D) 
< 6]. 

Like the proof of [8, Ch. II 1.5B] using the silly square. 3.12 


* * * 


We turn to other measures of [] A/D. 
Definition 9. (a) T(A) = sup{|F|: F CTIA and fi 4 fo € F => fi Ad 


Jats: 
(b) T5(A) = min{|F| : 


(i) F CTIA 
(i) f#Af,€ F > hi Fp fe 
(iit) F maximal under (i)+(ii)} 


(c) TZ(A) = min{|F| : F CTIA and for every f, € TA, for some fo € F 
we have -f, Ap fo}. _ 7 - 

(d) FTSZO)=TLA) = TAO) thendes PHO) =TL (A) for <3: 

(e) For f € "Ord and £ <3 let Th(f) means Th((f(a) :a<&)). 


Theorem 1. (0) If Do © Dy are filters on « then Th, (A) < Th, (A) for 
£=0,2. Also if K = Ap U Ai, Ao € Dt, and Ay € D+ then Th(X) = 
min {Ts 4 (A), Tea, A)} ford =0,2. 

(1) htef p(T] A) < TQ) < TAA) < TBO) 

(2) If T3(A) > |P(«)/D| or just T3(A) > p, and P(K)/D satisfies the 
ut-c.c. then T2(\) = THA) = TA(A) so the supremum in 3.13(a) is 
obtained (so e.g. T9(A) > 2" suffice) 

(3) T2(A)<te8? = T2(X) (each rj infinite of course). 

(4) UhtelD Tic, fl < TRA) < [htet [jen FO] + reg(D)*/D where 
6 = reg,(D) in fact 0 = reg(D) + wsat(D) suffice 

(5) If D is an ultrafilter | [] \/D| = T(A) for e < 2. 

(6) Tn (4), f Nucg F(8) > 2 (or just (reg(D) +2)"/D < minece f(i)), then 
inten Tien SWISS" <TR(f) _ 

(7) If the sup in the definition of T9(A) is not obtained then it has cofinality 
> reg(D) and even is regular. 


Proof. (0) Check. 
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(1) First assume ps =: T2,(A) < htcfp([] A); then we can find p* = cf(p*) € 


Na 


(u, htcfp([] )] and fi = (ui : i < &), a sequence of regular cardinals, 
Nien Hi < Ai Such that w* = tef([] A/D) and let (fa : a < u*) exemplify 
this. Now let F exemplify w = T3(A), for each g € F let 


’ wal fay g(t) if g(t) < pi 

u's Ue Deena i; otherwise. 
So there is a(g) < y* such that g’ <p fag). Let a* = sup{a(g) : g € F}, 
now a* < u* (as uw* = cf w* > w= |Fl). Sogeé F > g Fp fas, 
contradiction. So really T2,(A) < htefp([] \) as required. 

If F exemplifies the value of T}}(A), it also exemplifies T7,(A) < |F| 
hence T3,(A) < Th()). 

Lastly if F exemplifies the value of T}(f) it also exemplifies T),(A) > 
IPI, so TEA) < T3(D). 
Let pu be |P(K P| or at least ys is such that the Boolean algebra P(«)/D 
satisfies the u+-c.c. Assume that the desired conclusion fails so T7,(A) < 
T2(A), so there is Fy C J] A, such that [f: 4 fe € Fo > fi AD fl, and 
|Fo| > TZ(A) + p (by the definition of TP(A)). Also there is Fy C [JA 
diempliniae the value of T2,(A). For every f € Fo there is gf € F2 such 
that ~f Ap gf (by the choice of Fy). As |Fo| > T2(A) + pu for some 
g € Fo, F* =: {f € Fo: gf = 9} has cardinality > T2,(f) + u. Now for 
each f € F* let Ay = {i < &: f(i) = g(i)}, clearly Ay € Dt. Now 
ft A;/D is a function from F* into P(k)/D, hence, if u = |P(K)/DI, 
it is not one to one (by cardinality consideration) so for some f’ 4 f” 
from F* (hence form Fo) we have Ay /D = Af” /D; but so 


fi< nif =f"O}2 fis ws fWM =9O} Ni <n: fH") = 9H} 
= Aj /D 


hence is 4 @ mod D, so =f’ #p f”, contradiction the choice of Fp. If 
pu A |P(K)/D| (as F* C Fo by the choice of Fo) we have : 


hA#Afe => Ap, nAp =O mod D 


so {Ay : f € F*} contradicts “the Boolean algebra P(k)/D satisfies the 
+ ” 
pit-c.c.”. 


(3) Assume that 0 < reg(D) and! p <+ TR(A). As w <+ TR(A) we can 


find fa € [[\ for a < p such that [a < B > fa 4p fe]. Also (as 
6 < reg(D)) we can find { A; : ¢ < 0} C D such that for every i < « the 
set w; =: {e < 0:1 € A-} is finite. Now for every function h : 6 + fu we 
define gn, a function with domain xk: 


I<* means here that the right side is a supremum, right bigger than the left or 
equal but the supremum is obtained 
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wa 


gn(t) = {(€, frre (@)) : € € wi} 


So [{gn(i) :h © °u}| < (Ai)!@! = Aj, and if hy ¥ hg are from °y then for 


some € < 0, hi(e) # ho(e) so Bay ny = {4: frye) (4) F fno(e(in)} € D 
that is Bhy ho nN Az € D so 


@, fie Bhy ho MA; then € € Wj, SO Yh, (2) # Gho(t). 
@2 BrijngNAe € D 


So (gn : h € %) exemplifies T3(A) > uw’. If the supremum in the 
definition of T}(A) is obtained we are done. If not then T(A) is a limit 
cardinal, and by the proof above: 


[e< TBO) & O<reg(D) => p? < T3(A)]. 


So if T3(A) has cofinality > reg(D) we are done; otherwise let it be 
Deco Me With we < TP(A) and 0 < regD. Note that by the previous 
sentence TR(\)® = TR(A)<t) = T] cg Me, and let {f§ : a < pe} C 
[]A be such that [a < 6 = f5 #p fs] and repeat the previous proof 
with Five) replacing fre). 

For the first inequality. Assume it fails so p =: T3,(f) < htefp([],-,, f(@) 
hence for some g € [];<¢q)(f(@) + 1), tcf ]:<,, 9), <p) is with 
A = cf(A) > pw. Let (fa : a < X) exemplifies this. Let F be as in 
the definition of T7,(f), now for each h € F, there is a(h) < \ such that 


{i<n«: if h(i) < g(i) then h(i) < fog) (i)} € D. 


Let a* = sup{a(h) +1:h € F}, now fox € [],-, fi) and he Fs 
h#p fax contradicting the choice of F’. 


For the second inequality. Repeat the proof of 3.9 except that here we 
prove F =: U-eg(Ne OM c, f(i)) exemplifies T5(f) < A. So let g* € 
I],<,, Ai, and we should find f € N such that (g* #p f); we replace 
clause (g) in the proof by (g)’g* < f2,, < f? mod D 

the construction is for ¢ < reg(D) and if we are stuck in ¢ then 
—f¢ #p g* and so we are done. 


(5) Straightforward. 
(6) Note that all those cardinals are > 2" and 2" > reg(D)*/D. Now write 


(7) See proof of part (3). Moreover, let ps = S> 


successively inequalities from (2), (4), (1) and (3): 


TR(f) = Taf) < [htefp T] FQ)" < (TRIS = TB(f)- 
t<K 

ext MesT < TRA), Me < TBA) 

as exemplified by {fe :¢ <7}, {f& : a < pe} respectively. Let ga be: if 


eet He Sa< waz Me then ga(i) = (fe(é), fad). So {ga a < pu} 


show: if T}(A) is singular then the supremum is obtained. 3.14 
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Claim 1. Assume D is a filter on k, f € “Ord, u®° = pw and 2" < p,Tpd(f), 
(see Definition 9(d) and Theorem 1(2)) and reg, (D) = reg(D). If u < Tpo(f) 
then for some sequence \ < f of regulars, w+ = tcf([] A/D), or at least 


(*) There are (Ain ia < ni): i <#), Ain = C£(Ain) < f(d) and a filter D* 
on Uren {i} x ni such that: pt = tef(T] i; n) Ain/D*) and D={ACk: 
Useatit x mi € D*f. 

Also the inverse is true. 


Remark 3.15A. 


(1) It is not clear whether the first possibility may fail. We have explained 
earlier the doubtful role of p®° = p. 

(2) We can replace + by any regular yz such that /\ 
we use 3.14(4) to get uw <t Tp/(f). 

(3) The assumption 2" < yw can be omitted. 


acy lal’? <p and then 


Proof. The inverse should be clear (as in the proof of 3.6, by 3.14(3)). 
Wlog f(i) > 2” for i < «, and trivially (reg(D))*/D < 2", so by 3.14(4) 


Tp(f) < [atefo([[ F@)e. 
t<K 

If u < htcfp([],.,, f(2)) we are done (by 3.12(1)), so assume htcf p([];-,, f(¢)) 
< ps, but we have assumed pp < Tp(f) so by 3.14(4) as reg, (D) = reg(D) we 
have p<tes(P) > pt. Let y < ps be minimal such that Vo<reg(D) x? > pw, and 
let 6 =: cf(x) so, as pp > 2" we know yx = y<re8() = p<tes(P) > wt, y > 
2 Nady la|<ree) < y. By the assumption w = p®° we know @ > No (of 
course @ is regular). By [8, VIII 1.6(2), IX 3.5] and [Sh513, 6.12] there is a 
strictly increasing sequence (jie : € < 6) of regular cardinals with limit y such 
thet ge = tel [esate Jy"): 

As clearly x < htcfp([],., f(¢)), by 2-12(1) there is for each « < 8, 
a sequence AX = (Af : i < «) such that A§ = cf(AF) < f(z), and 
tef([],<,,A¢/D) = Me, also wlog A; > 2". Let (Ae : € < 0) exemplify 
6 < reg(D) and nj = [{e < 6:4 © A-}| and {A\in : n < w} enumerate 
{Az : € satisfies i € A-}, so we have gotten (*). 3.15 


4.4. Conclusion 


Suppose D is an Ni-complete filter on « and reg,(D) = reg(D). If A; => 2" 
for i < « and supyep, Tp+a(d) > pw then for some A; = cf(X) < Ai we 
have 


sup htcf X,) 
ae pe D+A ({[) 


U<K 
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4.5. Conclusion 


Let D be an Ny-complete filter on « and reg,(D) = reg(D). If for 7 < x, B; 
is a Boolean algebra and ; < Depth*(B;) (see below) and 


ae <p < sup Tpya(d) 
AcD+ 


then p+ < Depth* ([],-, Bi/D) 


t<K 
Proof. Use 3.25 below and 3.16 above. 


Definition 10. For a partial order P (e.g. a Boolean algebra) let Depth* (P) 
=min{A: we cannot find aa € P for a < XA such that a < 8B > dg <p az}. 


4.6. Discussion 


(1) We conjecture that in 3.16 (and 3.17) the assumption “D is No-complete” 
can be omitted. See [16]. 

(2) Note that our results are for ~ = p®° only; to remove this we need 
first to improve the theorem on pp = cov (i.e. to prove cf(A) = No < 
\ => pp(dA) = cov(A,A,%1,2) (or sup{pp() : cfu = Xo < pp < Af = 
cf(S<x, (A), C) (see [8], [14], §1]), which seems to me a very serious open 
problem (see [8, Analitic guide, 14]). 

(3) In 3.17, if we can find fo € [[,<,, 4 fora<A:la<B<A=> fa < fe 
mod D] and =f, =p fo+1 then \ < Deptht([],-,, Bi/D). But this do 
es not help for \ regular > 2". 

(4) We can approach 3.15 differently, by 3.20-3.23 below. 


CLAIM 3.20: If 2?” < yp < Tp(A), (or at least 2!PIt* < yu < Tp(A)) and 
u<* = p, then for some 6-complete filter E C D we have Tg(X) > p. 


t<K 


Proof. Wlog @ is regular (as p<? = p &,cf(0) < 6 => ws? = yp). Let 
{fa : a < wt} CTJA, be such that [a < 8 > fa #D fal. We choose by 
induction on ¢,a¢ < put as follows: a¢ is the minimal ordinal a < pt such 
that Ee. C D where E¢,. = the 6-complete filter generated by 


{{i<K: fac (i) # falt)} se < Ch 


(note: each generator of E¢,._ is in D but not necessarily E¢,_ C D!). 

Let ac be well defined if ¢ < ¢*, clearly e < € > ae < ac. Nowif C* < pt, 
then clearly a* = Ucee. ag < p> and for every a € (a*,u*), Eo £ D, 
so for every such a there are Ay € D* and aq € [¢*]<® such that Ag = 
Usea, tt < @: fas (i) = fa(t)}. But for every A € D*, a € [¢*]<® we have 


fa: ee (0,0), 4a = Amey =a} Cfa2 fet pAc]T.. Afae (i ):€ € ag}}, 


hence has cardinality < 6° < 2" < p. Also |[¢*]<°| < w<° < wt,|Dt| <2" < 
EL” so we get easy contradiction. 
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So ¢* = y+, but the number of possible E’s is < 2?” , hence for some E we 
have |{e < pt : E.o, = E}| =u. Necessarily E C D and E is 6-complete, 


and { fa. :é< pt, and Ey, = E} exemplifies Tg(\) > 1, so F is as required. 
3.20 


4.7. Fact 


1. In 3.20 we can replace p+ by p* if 2?” < cf(u*) < w* < T8(A) and 
Pray hel? ee ei 


Proof. The same proof as 3.20. 
CLAIM 3.22: 


(1) If 2" < |[JA/D|, D an ultrafilter on «, w = cf(u) < |] A/DI, Aje, lél®? 
<p, and D is regular then ys < Depth* ([];-,, 4i/D) 
(2) Similarly for D just a filter but A €¢ Dt > []A/(D + A) =J] A/D. 


Proof. 


(1) Wlog A =: limp X = sup(\) , so | []A/D| = * (see 3.6, by [1]). If 
Lu < \ we are done; otherwise let ~ = min{y : x” = A*}, so FOO = 
A”, cf(v) < K but A < pw < A* hence A®° < p hence cf(x) > No, also by 
x’s minimality A. |i] < iJ" < x, and remember y < p = cf p< 
xX so by [8,. VIII 1.6(2)] there is (ue : € < cf(x)) strictly increasing 
sequence of regular cardinals with limit y, [[-<ce(y) He/ Je has true 
cofinality uw. Let ye = sup{uc : ¢ < e} + 2%, let i: K => cf(x) be 
a(t) = sup{e +1: i => xe}. If there is a function h € []J;_,, i(¢) such 
that Ajcct(yy{t < & : Ali) < j} = 0 mod D then [J], .,, Hnyiy/D has 
true cofinality 4 as required; if not (D,i) is weakly normal (i.e. there 
is no such h- see [13]). But for D regular, D is cf(y)-regular, some 
(Ae: € < cf(K)) exemplifies it and h(i) = max{e: € < i(i) andi € A,} 
(maximum over a finite set) is as required. 


Similarly using A =: lim infp (A). 3.22 


— 
i) 
—S 


4.8. Discussion 


1. In 3.20 (or 3.21) we can apply [12, §6] so w = tef([]U 
D={ACK:Ujey v € D*} and each aj, is finite. 


icp %i/D*), where 


See also in 3.15. 

CLAIM 3.24: If D is a filter on «, B; is the interval Boolean algebra on 
the ordinal a;, and |[];-,,ai/D| > 2" then for regular 4 we have: yw < 
Depth* ([],-,, Bi/D) iff for some py; < a; (for i < K) and A € D*, the true 
cofinality of [],—,, ui/(D + A) is well defined and equal to p. 
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Proof. The <= (i.e. if direction) is clear. For the > direction assume su 
is regular < Depth ([],-, Bi/D) so there are fa € J];2, Bi, such that 
Tle, Bi/D & fa/D < fe/D for a < 8p. 

Wlog pp > 2". Let fo(t) = Ulia,i,2e, Ja,i.2e41) where ja,ie <Jajie+1 < 
for € < 2n(a,i). As w = cf(u) > 2" wlog nai = ni. By [14], 6.60] (see 
more [Sh513, 6.1]) we can find A C A* =: {(i,0) : i < Ke < 2nqg} and 
(Yigit <4, £ < 2n;) such that (i,?) € A= 7, is a limit ordinal and 


i<k 


(*) For every f € [(;2¢4 Vig and a < p there is B € (a, p) such that 
(4,£)€ AX A> joie = Vie 
(1,0) € A= f(t,2) < jase < Vie 
(7,£) € A= cf(7Z{~) > 2” 


Let (4) = max{? < 2n(i) : (i, 2) © A} and let B = {i: ¢(i) well defined}. 
Clearly B € D* (otherwise we can find a < 6 < «such that fa/D = fg/D, 
contradiction). For (i,£) € A define 67, by BF, = sup{Yjm +1: (j,m) € A* 
and Ym < Vie}. Now Bi, < yj, as cf(yf~) > 2°. Let 


Y={a<u: if (7,0) ¢ A*\A thenjajie = Vie 
and if (1,2) € A then BF _ < ja,ei < V0} 


Let B, = {i € B: &(t) is odd}. Clearly By C B and B\ Bij = § mod D 
(otherwise as in (*)1, (*)2 below get contradiction) hence By € Dt. Now 
(*), for a < 6 from Y we have 


(Jo,i,e(2) 77 © Bi) S iesea) :4€ Bi) mod (D} Bi) 
[Why? as fo/D was non decreasing in [];_,, Bi/D| 
(*)2 for every a € Y for some 8,a <8 € Y we have 
data :4€ By) < (98 ,i,€(a) :4€ By) mod (D t By) 


[Why? by (*) above] 

Together for some unbounded Z CY, ((jo,iee@) 2 t € Bi)/(D | Bi) : 
a € Z) is <pj;p,-increasing, so it has a <(prB,)-eub (as pw > 2” see 3.10, 
and more in [8, IT §1]), say (jf : i € Bi) hence [],¢p, j7/(D | Bi) has true 
cofinality y by 1.3(12) and clearly 7* < Vie) < a;, so we have finished. 013 24 


CLAIM 3.25 If D is a filter on «, B; a Boolean algebra, \; < Deptht(B;) 
then 


(a) Depth([],-,, B/D) >: supacp+ tef([],<,, A4(D + A)) (ie. on the cases 
tcf is well defined). 
(b) Depth* ([];-,, B/D) is >: Depth* (P(«)/D) and is at least 


sup{[tef(] ] ;/(D +.A))]* +X; < Depth*(B;), A € D*}. 


1<k 
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Proof. Check. 

CLAIM 3.26: Let D be a filter on &, (A; : i < &) a sequence of cardinals 
and 2" < uy =cf(p). Then (a) (8) > (7) => (0), and if (V, < p)(o*%° < p) 
and reg,(D) = reg(D) we also have (y) = (6) where 


(a) If B; is a Boolean algebra, \; < Depth*(B;) then  < Depth*([] 


i<K 
B;/D) 
(8) There are pw; = cf(u;) < A; for i < « and A € D® such that p = 
tef([] wi/(D + A)) 


(y) There are (Ain 22 < ni) 2 i <6), Xn = C£(Ain) < Ai, A* € Dt anda 
filter D* on U,-,, {i} X ni such that: 


b= tef(] Xin /D*) and D+ A* ={ACk: the set |) {a} x 4 
(i,n) ic A 
belongs to D*}. 
(5) For some A€ Dt , w< To+a((i 22 < &)) 


Remark. So the question whether (a) = (6) assuming (V, < p)(o*° < p) is 
equivalent to (6) < (6) which is a “pure” pcf problem. 


Proof. Note (y) = (6) is easy (as in 3.15, ie. as in the proof of 3.6, only 
easier). Now (3) = (7) is trivial and (G) = (a) by 3.25. Next (a) => (8) 
holds as we can use (a) for B; =: the interval Boolean algebra of the order 
A; and use 3.24. Lastly assume (Vo < )(o*° < ys) and reg,(D) = reg(D), 
now (y) > (d) by 3.15. 3.26 


Discussion. We would like to have (letting B; denote Boolean algebra) 
Depth (| J B;/D) > |] Depth (B;)/D 
1<K 1<K 
if D is just filter we should use 7p and so by the problem of attainment 
(serious by Magidor and Shelah [15]), we ask 


® For D an ultrafilter on K, does \; < Depth (B;) for i < « implies 
[[\:/2 < Depth* ([] B/D) 
t<K t<K 
at least when A; > 2" ; 
@! For D a filter on &, does \; < Depth*(B;) for i < « implies, assuming 
A; > 2" for simplicity, 
Tp((Ai i < &)) < Depth* ([] Bi/D) 
1<kh 


As explained in 3.26 this is a pef problem. 
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In [16] we deal with this under reasonable assumption (e.g. fp = x* and 
x = x*°). We also deal with a variant, changing the invariant (closing under 
homomorphisms, see [6]). 


5. Remarks on the Conditions for the pcf Analysis 


We consider a generalization whose interest is not so clear. 

CLAIM 4.1: Suppose \ = (A; : i < «) is a sequence of regular cardinals, 
and @ is a cardinal and J* is an ideal on «; and 4H is a function with domain 
k. We consider the following statements: 


(**) liminfy-(A) > 6 > wsat(I*) and H is a function from « to P(6) 
such that: 


(a) For every ¢ < 0 we have {i <&K:¢ € H(i)} =« mod I* 
(b) For i < & we have otp(H(i)) < A; or at least {i < «: |H(i)| > 
ri} Ee I* 


(x*)* Similarly but 
(b)+ For i < « we have otp(H(i)) < A; 


(1) In 1.5 we can replace the assumption (*) by (**) 7 above. 

(2) Also in 1.6, 1.7, 1.8, 1.9, 1.10, 1.11 we can replace 1.5(«) by (**)H. | 

(3) Suppose in Definition 4(2) we say f obeys a@ for H (instead of for Ax) 
if 


(i) For 8 € ag, such that ¢ =: otp(aa) < 0 we have 
otp(as), otp(as) € H(t) > falt) < fai) 


and in 2.3(2A), fa(z) = sup{fa(2) : 8 € aq and otp(ag), otp(aa) € 
A(i)}. 


Then we can replace 1.5(*) by (**)q in 2.5, 2.5A , 2.6; and replace 1.5(«) 
by 


(«x)f, In 2.7 (with the natural changes). 


Proof. (1) Like the proof of 1.5, but defining the g-’s by induction on € we 
change requirement (ii) to 


(ii)’ If ¢-, and {¢,e} C H(i) then ge(z) < g-(t). 
We can not succeed as 
(Bay \ Bay) fi <wieet+1e H@}:6 <6) 


is a sequence of 6 pairwise disjoint member of (/*)?. 


486 Saharan Shelah 


In the induction, for ¢ limit let g-(¢) < Uf{ge(t) : ¢ € H(i) and 
é € H(t)} (so this is a union at most otp(H(i) Ne) but only when 
e € H(t) hence is < otp(H(i)) < Ai). 

(2) The proof of 1.6 is the same, in the proof of 1.7 we again replace (ii) by 
(ii)’. Also the proof of the rest is the same. 

(3) Left to the reader. 41 


We want to see how much weakening (*) of 1.5 10 “liminfy.(A) > 6 = 
wsat(I*) suffices. If 6 singular or liminfy«(A) > @ or just ([](A), <r) is 67- 
directed then case (3) of 1.5 applies. This explains (*) of 4.2 below. 

CLAIM 4.2: Suppose A = (Ay: 4 < &), Ai = C£(Aa), J* an ideal on & and 

lim inf 7(A) = 6 > wsat(J*), 6 regular (*) 

Then we can define a sequence J = (J¢ : ¢ < ¢(*)) and an ordinal 

¢(*) < OF such that 


(a) J is an increasing continuous sequence of ideals on kK. 

(b) Jo =I", Jeu =: {A : AC k,and: A © Je or we can find h: A 0 
such that A; > h(t) and e < 6 => {i: h(i) < ec} € Je} 

(c) For ¢ < G(x) and A € Je41\ Je, the pair ([] A, Je +(«\A)) (equivalently 
(T]A | A,Je | A)) satisfies condition 1.5(*) (case (8) hence its 
consequences, (in particular it satisfies the weak pcf-th for @). 

(d) If K ¢ Uccex) Je then ([] A, Uccc«) Je) has true cofinality 0. 


Proof. Straight. (We define Jc for ¢ < 0+ by clause (b) for ¢ = 0, ¢ successor 
and as U.-¢ Je for ¢ limit. Clause (c) holds by claim 4.4 below. It should be 
clear that Jg+41 = Jo+, and let C(*) = min{¢ : Jeq1 = Usec Je} 80 we are 
left with checking clause (d). If A € Tex) h € [],e4 Xi, choose by induction 
on¢ < 6,e(¢) < @ increasing with ¢ such that {i < « : h(i) € (e(¢),e(¢+1)) € 
J tay) If we succeed we contradict 6 > wsat(I*) as @ is regular. So for some 


¢ < 6,e(¢) is well defined but not e(¢ +1). As Jer) = Jecs)41, Clearly 
if 

{i<«: h(t) < e(Q)} =«& mod Jey). So ge(t) = ea nabigs exemplifies 
0 ife > vi 


tef(T] A/ Seas) = 8. 4.2 


Now: 


5.1. Conclusion 


Under the assumptions of 4.2, I* satisfies the pseudo pcf-th (see Defini- 
tion 6(4)). 

CLAIM 4.4: Under the assumption of 4.2, if J is an ideal on & extending 
I* the following conditions are equivalent 


(a) For some h € [] A, for every e < 0 we have {i€ A: h(i) <eleJ 
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(b) (TL A, <s+(n\4)) is 0*-directed. 
Proof. (a) > (b) 


Let fe € [JA for ¢ < 0, we define f* € [] A by 
f(a) = sup{ felt) +1: ¢ < Ai}. 
Now f*(z) < Ay as A(i) < A = cf(A;) and fe [A <7; f* [Aas {ie A: 
h(i)<¢}e J. 
(b) = (a) 
Let f¢ be the following function with domain «: 


aoe ¢ if C< A; 
1a) = {6 if C> Ay 


As liminfy. > 6, clearly ¢ < ¢ > fe <r fe and of course f¢ € I]. By 
our assumption (b) there is h € []A such that ¢<@=> fe [A<h 
mod J. Clearly h is as required. 4.4 
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Over a span of more than 60 years, Paul Erdés took the art of collaborative 
research in mathematics to heights never before achieved. In this brief look 
at his collaborative efforts, we will explore the breadth of Paul’s interests, the 
company he kept, and the influence of his collaboration in the mathematical 
community. Rather than focusing on the mathematical content of his work 
or the man himself, we will see what conclusions can be drawn by looking 
mainly at publication lists. Thus our approach will be mostly bibliographical, 
rather than either mathematical or biographical. The data come mainly from 
the bibliography in this present volume and records kept by Mathematical 
Reviews (MR) [15]. Additional useful sources of information include The 
DBLP Computer Science Bibliography (a database of articles in computer 
science) [3], Zentralblatt [19], the Jahrbuch [14], various necrological articles 
too numerous to list, and personal communications. Other articles on these 
topics can be found in [4, 5, 7, 8, 9, 10, 11, 16]. 

Paul became a legend during his long and productive life, and his 
fame (as well as his genius and eccentricity) spread beyond the circles of 
research mathematicians. We find a popular documentary film about him [2], 
articles in general circulation magazines [13, 18] (as well as in mathematical 
publications—see [1] for a wonderful example), two popular biographies 
[12, 17], and graffiti (e.g., his quotation that a mathematician is a device for 
turning coffee into theorems). But even within the academic (and corporate 
research) community, his style and output have created a lot of folklore. 

The reader is probably familiar with the concept of Frdés number, defined 
inductively as follows. Paul has Erdés number 0. For each n > 0, a person not 
yet assigned an Erdds number who has written a joint mathematical paper 
with a person having Erdés number n has Erdés number n + 1. Anyone 
who is not assigned an Erdds number by this process is said to have Erdos 
number oo. Thus a person’s Erdés number is just the distance from that 
person to Paul Erddés in the collaboration graph C,, in which two authors are 
joined by an edge if they have published joint research. For example, Albert 
Einstein has Erdos number 2, because he did not collaborate with Paul Erdés, 
but he did publish joint research with Ernst Straus, who was one of Paul’s 
major collaborators. Purists can argue over how to count papers with more 
than two authors, but here we will adopt the liberal attitude that each of 
the 9) pairs of authors in a k-author paper are adjacent in C. Of course one 
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need not restrict the collaboration graph to the field to mathematics (more 
about this below). 

A common variant is to give a person who has written p > 0 papers 
with Paul the Erdés number 1/p. Andras Sarkozy, with I and Andras 
Hajnal, with 36 have the smallest positive Erdés numbers under this 
definition, followed in order by Faudree, Schelp, Sd6s, Rousseau, Rényi, Turan, 
Szemerédi, Graham, Burr, Spencer, Pomerance, Simonovits, Pach, Straus, 
Nicolas, Nathanson, Rado, Bollobas, Milner, Gyarfads, Selfridge, Piranian, 
Hall, Chung, Jodé, Rédl, and Reddy, who all have a value under 0.1. (This list 
is based on data from 2010, but most of the other statistics in this paper are 
based on data from 1995.) 

The author maintains lists of coauthors of Paul Erdés and coauthors of 
these coauthors (i.e., all people with Erdés number not exceeding 2) and 
updates these lists periodically. They are available as part of the Erdds 
Number Project website [6]. (Difficulties in author identification, among other 
problems, surely make these lists and other data discussed in this article less 
than 100% accurate, but we believe that the number of errors is not large.) 

As was pointed out in [10], the average number of authors per research 
article in the mathematical sciences increased dramatically over Paul Erdés’s 
lifetime. (One can speculate whether his existence is part of the reason for 
this.) Specifically, the fraction of all authored items reviewed in MR having 
two or more authors has increased, as a function of time. While over 90 % of all 
papers in 1940 (when MR began) were the work of just one mathematician, 
today only about half of them are solo works. In the same period, the fraction 
of two-author papers has risen from under 10% to about one third. Also, in 
1940 there were virtually no papers with three authors, let alone four or more; 
now about 10% of all papers in the mathematical sciences have three or more 
authors, including about 2% with four or more. 
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Fig. 1 The number of authors on Paul Erdés’s papers over the years. 
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The same trend can be seen in Paul’s work, but with an even greater 
amount of collaboration. The graph in Fig. 1 shows the fraction of 1-, 2-, 3-, 
and > 4-author items in Paul’s publication list, year by year through 1995. 
(Almost all of these are research papers. The rest are books, articles about 
people, or other writings.) For reference, Fig.2 shows the absolute sizes we 
are talking about—the number of publications year by year. (The last two 
counts—for 1994 and 1995—are probably too low, because of incomplete 
publication data at the time the figures were constructed.) Cumulatively 
(Fig. 3), fewer than one third of Paul’s 1,400 works completed by 1995 were 
solo ventures. In fact, the mean number of authors (including Erdés) is almost 
exactly two. 
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Fig. 2 The number of papers by Paul Erdos over the years. 
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Fig. 3 The fraction of Paul Erdés’s papers with different numbers of authors. 


Paul’s mode of operation (dating from his departure from a permanent 
position at Notre Dame around 1954) was unique among mathematicians. 
Rather than staying at a home institution (research institute or academic 
department), he was constantly on the move, visiting mathematicians at 
conferences and research centers around the world. He often spent some 
summer months in Budapest, where he was a member of the Hungarian 
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Academy of Sciences and where he could work with several of his most prolific 
collaborators. Some of his favorite haunts included Memphis, Tennessee, 
the New York City area, and other places too numerous to list. He was 
a permanent fixture at the annual Southeastern Combinatorics Conference 
(in Boca Raton, Florida, or Baton Rouge, Louisiana) and other regular 
meetings in his various fields. For example, in the few months around the 
time this article was originally being written, Paul reported having been (or 
planning to go) at least to Atlanta, Memphis, three cities in Texas, New 
Jersey, New Haven, Baton Rouge, Colorado, France, Germany, Kalamazoo, 
and Pennsylvania, in that order. 

As he met and worked with ever-increasing numbers of people on his 
travels, it is not surprising that Paul added new coauthors every year since 
1936. (Only two—George Szekeres and Paél Turdn—published with him before 
that, in 1934.) Figure 4 shows how the cumulative number of coauthors 
increased, while Fig.5 shows the discrete time derivative of this function. 
Paul usually left the actual writing up of the papers to his collaborators— 
partly, he said, because he did not type. 
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Fig. 4 The cumulative number of Paul Erdés’s coauthors as a function of time. 


As the present collection shows, Paul Erdés’s papers span many branches 
of mathematics and exploit relationships among them. (One fine example of 
the latter is his application of probability to combinatorics.) Mathematical 
Reviews currently has about 60 broad subject classifications, ranging from 
“Mathematical Logic and Foundations” through “Information and Commu- 
nication, Circuits” (plus a section on history and biography), one of which it 
assigns to every item it records as its primary subject area. This list has varied 
slightly over time, with some categories being added or discarded as MR tries 
to keep up with current trends. Paul’s works, although often spanning two 
or more subject areas and therefore difficult to pinpoint into one category, 
have been given primary classification in about 40% of these subject areas 
or their equivalent predecessors. They include not only the two main areas of 
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Fig. 5 The number of new coauthors added each year. 


number theory and combinatorics, and substantial work in approximation 
theory, geometry, set theory, and probability theory, but also papers in 
mathematical logic and foundations, lattices and ordered algebraic structures, 
linear algebra, group theory, topological groups, polynomials, measure and 
integration, functions of a complex variable, finite differences and functional 
equations, sequences and series, Fourier analysis, functional analysis, general 
and algebraic topology, statistics, numerical analysis, computer science, and 
information theory. 


other 
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Fig. 6 Paul Erdés’s papers since 1979 by broad category. 


The chart in Fig.6 shows the fractions of Paul’s publications reviewed 
in MR from 1980 to 1995 in the various categories. Such a tabulation was 
easy to do, because the old-style 1/7 R review number included the category 
code as the first two digits following the colon. Figure 7 is a less accurate pie 
chart covering all years. Taken together, these graphics suggest a slight trend 
toward increased work in combinatorics (including graph theory) in the later 
years, with a comparable decline in the output in number theory. Indeed, 
nearly all Paul’s early papers were in number theory (61 of the first 64, by 
one count, covering the period 1932-1939). 
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Fig. 7 Paul Erdés’s papers by broad category (approximate). 


Since Erdés’s coauthors work in such varied fields, one would expect 
the set of people with Erdés number 2, 3, or a little higher to range over 
essentially all of mathematics. Indeed, this is the case. All Fields Medalists 
(through 2010) are in the Erdés component of the collaboration graph, with 
Erdés number at most 5. (The Erdés component of C' is just the connected 
component that contains Erdés.) This group includes people working in 
theoretical physics; for instance, there is the path Edward Witten—Sidney 
Coleman—Daniel J. Kleitman—Paul Erdos. Thus one can conjecture that many 
(if not most) physicists are also in the Erdés component, as are, therefore, 
many (or most) scientists in general. The large number of applications of 
graph theory and statistics to the social sciences might also lead one to 
suspect that many researchers in other academic areas are included as well. 

It is interesting to explore the publication lists (or at least the coauthor 
lists) of Erdés’s coauthors, to see how much collaboration went on after 
Paul left town. Let E, be the subgraph of C induced by people with Erdés 
number 1. According to data collected through 1995, Fy contains 458 vertices 
and 1,218 edges; thus an average Erdoés coauthor collaborated with over 5 
other Erdés coauthors. (The median, as opposed to the mean, of this statistic 
is only 3, however. Its standard deviation is about 6, and it takes values 
over 30 in four cases—Ron Graham, Frank Harary, Vojtéch Rodl, and Joel 
Spencer.) There are only 40 isolated vertices in E, (less than 9%), and three 
components with two vertices each. The remaining 412 vertices in E, induce 
a connected subgraph. Paul’s style seemed to rub off. 

Looking at it more broadly, we find that people with Erdés number 1 
have a mean of 20 other collaborators (median 15, standard deviation 22), 
and only six of them collaborated with no one except Erdés. As of 1995, 
five of them had over 100 coauthors (Frank Harary, Saharon Shelah, Ron 
Graham, Noga Alon, and Dan Kleitman). 

Another 4,546 people as of 1995 felt Paul’s influence second-hand—by 
doing joint research with one of the honored 458. (These numbers increased 
to 9,267 and 511, respectively, as of 2010.) Three quarters of the people with 
Erdés number 2 have only one coauthor with Erdés number 1 (i.e., each such 
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person has a unique path of length 2 to Erdés in C). However, their mean 
number of Erdés number 1 coauthors is 1.5 (standard deviation 1.2), and the 
count ranges as high as 13 (for Dwight Duffus and Linda Lesniak). 

The bibliography in the original edition of this volume listed about 1,400 
papers, but it was incomplete, especially with regard to works that appeared 
near the end of Paul’s life (and posthumously—a paper with Florian Luca 
and Carl Pomerance was published in 2008). An addendum is available on the 
Erdés Number Project website; the total as of 2010 stood at 1,525. Most of the 
papers published since 1939 appear in the Mathematical Reviews database. 
Reviews of Paul Erdés’s papers appeared in every volume that MR published 
during his lifetime, including a review of a joint paper with Tibor Gallai on 
page 1 of volume 1, written by George Polya. It is interesting to note that 
the second most prolific writer in the MR database (as of 1995) is Leonard 
Carlitz, with about 735 items. Carlitz had Erdés number 2 (via seven different 
coauthors) and wrote the MR review of several of Paul’s papers. In all, nearly 
500 different people reviewed Erdés’s papers for MR. Paul wrote for MR as 
well and had over 700 reviews to his credit. 

Readers with additions or corrections to any of the information in this 
article, the accompanying bibliography, or the Erdés Number Project website 
are urged to communicate with the author (grossman@oakland.edu). 
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Postscript 


It is fitting that Paul Erdés himself should have the last word. As is 
amply illustrated in this collection, Paul’s profound influence on so many 
mathematicians and fields of mathematics through his prolific research and 
incessant traveling is destined to leave a legacy that may never be equalled. 
Here then are some very special lines written by Paul for the Postscript of 
these volumes: 

“Determine or estimate as well as you can the number of solutions in 
positive integers of 


+ fees = 1, Uy < U2 <3 + Lp. 
XY v2 Lk 


Can the squares be decomposed as the finite union of sum-free sets (or Sidon) 
sequences? Apologies if this is trivial or trivially false. Let 71,...,2%,, be n 
points in the plane with at most t on a line (t fixed, n large). f(n;t) is defined 
to be size of the largest subset with no three on a line. We can only prove 


c 
caf? < f(nth< 


The lower bound is easy by the greedy algorithm. Fiiredi showed 


f(nt) > a(n) 


where g(n) — co slowly. What is the truth here? 

(Erdés—Turdan.) Let pnii — pn = dy. Prove that for infinitely many n, 
dyn > dni1 > drag or dn < dn41 < dn+2. 

This surely holds since if not, then there is an n for which 


dn = dn41,dn41 < dn42,dn42 > dn43,-.., ete. 
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I offer $100 for a proof and $25,000 for a counterexample. This of course 
is a joke since the conjecture surely holds. 

If I live I hope to have some more conjectures and even proofs. Will there 
be a celebration for my 90th birthday or only a meeting for my memory. May 
my theorems and problems live forever. 

My mother was very glad to read the many eulogies written for my 50th 
birthday. I am only sorry that my mother and father are not reading this 
volume (if you believe in survival after death then you can believe that 
perhaps they are reading it). Let me add another problem: In 1934 Turan and 
I proved (Amer. Math, Monthly 1934): Let a, < ag <... < dy be any set of 


n integers. Then the number of distinct prime factors of _ [[ (a; +;) is 
1<i<j<n 
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greater than clogn. It does not have to be greater than ;“~ (trivially). Try 


to improve both the upper and lower bounds. I offer 500 dollars for this.” 
KK 


The preceding lines are some of the last lines written by Paul Erdés. They 
not only represent a fitting conclusion to these volumes but they capture 
Paul’s style and his vision of life as a scholar, a style which influenced us all 
from around the world, a world which will not be the same without him. 


